ML
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
🦜🔗 Build context-aware reasoning applications
ClearML - Auto-Magical CI/CD to streamline your AI workload. Experiment Management, Data Management, Pipeline, Orchestration, Scheduling & Serving in one MLOps/LLMOps solution
🔊 Text-Prompted Generative Audio Model
React hook for OpenAI Whisper with speech recorder, real-time transcription, and silence removal built-in
Port of OpenAI's Whisper model in C/C++
Faster Whisper transcription with CTranslate2
The official GitHub page for the survey paper "A Survey of Large Language Models".
A fast inference library for running LLMs locally on modern consumer-class GPUs
State-of-the-art 2D and 3D Face Analysis Project
Stable Diffusion web UI
DSPy: The framework for programming—not prompting—language models
ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.
ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels (easy/hard) across eight real-life scenarios.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.
Whisper realtime streaming for long speech-to-text transcription and translation
On-device wake word detection powered by deep learning
A python package to build AI-powered real-time audio applications
Instant voice cloning by MIT and MyShell. Audio foundation model.
Dynamic batching library for Deep Learning inference. Tutorials for LLM, GPT scenarios.
An official implementation of Pangu-Weather
Simple text to phones converter for multiple languages