Stars
Efficient Triton Kernels for LLM Training
A high-performance algorithmic trading platform and event-driven backtester
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
一个简洁优雅的词典翻译 macOS App。开箱即用,支持离线 OCR 识别,支持有道词典,🍎 苹果系统词典,🍎 苹果系统翻译,OpenAI,Gemini,DeepL,Google,Bing,腾讯,百度,阿里,小牛,彩云和火山翻译。A concise and elegant Dictionary and Translator macOS App for looking up words an…
🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.
Zero Bubble Pipeline Parallelism
DeepEP: an efficient expert-parallel communication library
Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
FlashInfer: Kernel Library for LLM Serving
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Ola: Pushing the Frontiers of Omni-Modal Language Model
verl: Volcano Engine Reinforcement Learning for LLMs
A synthetic data generator for text recognition
SGLang is a fast serving framework for large language models and vision language models.
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
Repository hosting code for "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
Official PyTorch implementation of the paper "DisCo-CLIP: A Distributed Contrastive Loss for Memory Efficient CLIP Training".
Some useful custom hive udf functions, especial array, json, math, string functions.
Official Pytorch Implementation for "VidToMe: Video Token Merging for Zero-Shot Video Editing" (CVPR 2024)
The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)
Using tabular and deep reinforcement learning methods to infer optimal market making strategies
Ongoing research training transformer models at scale