Skip to content
View davidberenstein1957's full-sized avatar
🦦
🦦

Organizations

@Giskard-AI @PrunaAI

Block or report davidberenstein1957

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Hi there πŸ‘‹

From failing to study medicine ➑️ BSc industrial engineer ➑️ MSc computer scientist.
Life can be strange, so better enjoy it.
IΒ΄m sure I do by: πŸ‘¨πŸ½β€πŸ³ Cooking, πŸ‘¨πŸ½β€πŸ’» Coding, πŸ† Committing.

Conferences/Presentations πŸ“–

  • Synthetic Data - Weaviate Podcast #118! - podcast
  • SmolAgents - From Bells and Whistles to Agents and Tools - slides video
  • No data? No problem! - synthetic data to the rescue - slides video
  • Practical AI Podcast - Towards high-quality (maybe synthetic) datasets - podcast
  • Code Together Podcast Intel Software - Scaling LLM Datasets with Less Effort Using Argilla - video
  • Mastering LLMs - Creating, curating, and cleaning data for LLMs - slides video
  • 🧼 From GPU-poor to data-rich - data quality practices for LLM fine-tuning - slides
  • Deeplearning.ai LLM workshop - get started with Argilla for human- and distilabel for AI feedback - video
  • NLP Healthcare Summit 2023 - Smart Shortcuts for Bootstrapping a Healthcare NER Project - video
  • Anyscale Ray Europe Meetup - Smart shortcuts for Bootstrapping a Text Classification project - video

AI Code Content

Employers πŸ‘¨πŸ½β€πŸ’»

Open source ⭐️

Maintainer πŸ€“

Contributions πŸ«±πŸΎβ€πŸ«²πŸΌ

Volunteering 🌍

  • Bonfari - small to medium sustainable scale projects in Gambia πŸ‡¬πŸ‡²
  • 510 red-cross - occasional projects to improve humanitarian aid with data

Contacts

Gmail LinkedIn Twitter

Pinned Loading

  1. argilla-io/distilabel argilla-io/distilabel Public

    Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

    Python 2.9k 220

  2. PrunaAI/pruna PrunaAI/pruna Public

    Pruna is a model optimization framework built for developers, enabling you to deliver faster, more efficient models with minimal overhead.

    Python 905 69

  3. argilla-io/argilla argilla-io/argilla Public

    Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets

    Python 4.7k 460

  4. concise-concepts concise-concepts Public

    This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with entity scoring.

    Python 244 14

  5. crosslingual-coreference crosslingual-coreference Public

    A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.

    Python 107 19

  6. spacy-setfit spacy-setfit Public

    This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.

    Python 80 5