Skip to content
View davidberenstein1957's full-sized avatar
🦦
🦦

Organizations

@Giskard-AI @PrunaAI

Block or report davidberenstein1957

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Hi there πŸ‘‹

From failing to study medicine ➑️ BSc industrial engineer ➑️ MSc computer scientist.
Life can be strange, so better enjoy it.
IΒ΄m sure I do by: πŸ‘¨πŸ½β€πŸ³ Cooking, πŸ‘¨πŸ½β€πŸ’» Coding, πŸ† Committing.

Conferences/Presentations πŸ“–

  • Synthetic Data - Weaviate Podcast #118! - podcast
  • SmolAgents - From Bells and Whistles to Agents and Tools - slides video
  • No data? No problem! - synthetic data to the rescue - slides video
  • Practical AI Podcast - Towards high-quality (maybe synthetic) datasets - podcast
  • Code Together Podcast Intel Software - Scaling LLM Datasets with Less Effort Using Argilla - video
  • Mastering LLMs - Creating, curating, and cleaning data for LLMs - slides video
  • 🧼 From GPU-poor to data-rich - data quality practices for LLM fine-tuning - slides
  • Deeplearning.ai LLM workshop - get started with Argilla for human- and distilabel for AI feedback - video
  • NLP Healthcare Summit 2023 - Smart Shortcuts for Bootstrapping a Healthcare NER Project - video
  • Anyscale Ray Europe Meetup - Smart shortcuts for Bootstrapping a Text Classification project - video

AI Code Content

Employers πŸ‘¨πŸ½β€πŸ’»

Open source ⭐️

Maintainer πŸ€“

Contributions πŸ«±πŸΎβ€πŸ«²πŸΌ

Volunteering 🌍

  • Bonfari - small to medium sustainable scale projects in Gambia πŸ‡¬πŸ‡²
  • 510 red-cross - occasional projects to improve humanitarian aid with data

Contacts

Gmail LinkedIn Twitter

Pinned Loading

  1. argilla-io/distilabel argilla-io/distilabel Public

    Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

    Python 3k 221

  2. PrunaAI/pruna PrunaAI/pruna Public

    Pruna is a model optimization framework built for developers, enabling you to deliver faster, more efficient models with minimal overhead.

    Python 1k 72

  3. argilla-io/argilla argilla-io/argilla Public

    Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets

    Python 4.8k 464

  4. concise-concepts concise-concepts Public

    This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with entity scoring.

    Python 244 14

  5. crosslingual-coreference crosslingual-coreference Public

    A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.

    Python 108 19

  6. spacy-setfit spacy-setfit Public

    This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.

    Python 80 5