Stars
Userspace eBPF runtime for Observability, Network, GPU & General Extensions Framework
Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels
Repository hosting code for "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).
The absolute trainer to light up AI agents.
Tensors and Dynamic neural networks in Python with strong GPU acceleration
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are commi…
⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with PyTorch / Transformers / LLaMA Factory / veRL/ Swift / Ultra…
An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models
This repo is used for archiving my notes, codes and materials of cs learning.
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
AndroidWorld is an environment and benchmark for autonomous agents
Democratizing Reinforcement Learning for LLMs
Distributed Compiler based on Triton for Parallel Systems
A multi-cluster pod deletion protection webhook with high scalability and disaster tolerance
My learning notes/codes for ML SYS.
Android in docker solution with noVNC supported and video recording
A high-throughput and memory-efficient inference and serving engine for LLMs
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
DeepEP: an efficient expert-parallel communication library
verl: Volcano Engine Reinforcement Learning for LLMs
Decentralized deep learning in PyTorch. Built to train models on thousands of volunteers across the world.
OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training