Stars
⚡️ GenBI (Generative BI) queries any database in natural language, generates accurate SQL (Text-to-SQL), charts (Text-to-Chart), and AI-powered business intelligence in seconds.
Minimalistic 4D-parallelism distributed training framework for education purpose
Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)
ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)
Seemless interface of using PyTOrch distributed with Jupyter notebooks
Machine Learning Engineering Open Book
A toolkit to run Ray applications on Kubernetes
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
An intuitive and low-overhead instrumentation tool for Python
Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support
Create committing rules for projects 🚀 auto bump versions ⬆️ and auto changelog generation 📂
verl: Volcano Engine Reinforcement Learning for LLMs
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)
how to optimize some algorithm in cuda.
Scalable toolkit for efficient model reinforcement
The simplest, fastest repository for training/finetuning small-sized VLMs.
Minimal and annotated implementations of key ideas from modern deep learning research.
CUDA Python: Performance meets Productivity