Lists (32)
Sort Name ascending (A-Z)
Books
Reading books and programming booksCloud
Cloud Computing relatedCompile
Compilers and toolscuda
cuda related projectsData Analysis
Data Structure and Algorihms
Data Structure and AlgorihmsDatabase
Database realtedDeep Learning
Deep Learning books, courses, projects, models and algorithmsDistributed
Distributed SystemsDiT
Documentary
Document relatedemployment
game
Gaming
Gaming relatedgeo-distributed
Guidance
Guidance courses/books/docsHardwares
Hardwares realtedinterview
interview relatedladders
I can not say anything about itllm
large language modelsLLM Inference
machine learning
Machine Learning models, algorithmsMLSys
Machine Learning Systemsnetwork
Programming Languages
About Programming LanguagesRDMA
Softwares
Good Softwaressystem
systems and related toolstools
Virtual Technology
Virtual Technology Relatedweb
Web Frontend
Web Frontend widgets, frameworksStarred repositories
Perplexity open source garden for inference technology
support Multiple Producer and Multiple Consumer with lock-free queue
verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"
A library of reinforcement learning components and agents
This repository organizes materials, recordings, and schedules related to AI-infra learning meetings.
BurstEngine is an efficient framework designed to train LLMs on long-sequence data.
flash attention tutorial written in python, triton, cuda, cutlass
NVSHMEM‑Tutorial: Build a DeepEP‑like GPU Buffer
A high-performance inference engine for LLMs, optimized for diverse AI accelerators.
RLinf is a flexible and scalable open-source infrastructure designed for post-training foundation models (LLMs, VLMs, VLAs) via reinforcement learning.
Distributed Compiler based on Triton for Parallel Systems
Byted PyTorch Distributed for Hyperscale Training of LLMs and RLs
A fast communication-overlapping library for tensor/expert parallelism on GPUs.
verl: Volcano Engine Reinforcement Learning for LLMs
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
A TensorFlow implementation of Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures.
Bridge Megatron-Core to Hugging Face/Reinforcement Learning
Unified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
✔(已完结)最全面的 深度学习 笔记【土堆 Pytorch】【李沐 动手学深度学习】【吴恩达 深度学习】
GPU Stress Test is a tool to stress the compute engine of NVIDIA Tesla GPU’s by running a BLAS matrix multiply using different data types. It can be compiled and run on both Linux and Windows.
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.
LLM notes, including model inference, transformer model structure, and llm framework code analysis notes.
Learning Large Language Model (LLM)(大语言模型学习)