Skip to content
View llhe's full-sized avatar

Organizations

@XiaoMi

Block or report llhe

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Minimal reproduction of DeepSeek R1-Zero

Python 12,594 1,543 Updated Apr 24, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 18,289 3,017 Updated Jan 13, 2026

Cosmos-RL is a flexible and scalable Reinforcement Learning framework specialized for Physical AI applications.

Python 282 41 Updated Jan 13, 2026

cuTile is a programming model for writing parallel kernels for NVIDIA GPUs

Python 1,806 98 Updated Jan 13, 2026

💫 Toolkit to help you get started with Spec-Driven Development

Python 62,138 5,395 Updated Dec 4, 2025

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Python 982 83 Updated Sep 4, 2024

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 24,852 3,484 Updated Jan 8, 2026

Tritonbench is a collection of PyTorch custom operators with example inputs to measure their performance.

Python 310 59 Updated Jan 13, 2026

个人构建MoE大模型:从预训练到DPO的完整实践

Python 2,231 164 Updated Dec 30, 2025

The best ChatGPT that $100 can buy.

Python 40,223 5,182 Updated Jan 12, 2026

Nano vLLM

Python 10,726 1,376 Updated Nov 3, 2025

collection of benchmarks to measure basic GPU capabilities

C++ 479 73 Updated Oct 24, 2025

CRS-自建Claude Code镜像,一站式开源中转服务,让 Claude、OpenAI、Gemini、Droid 订阅统一接入,支持拼车共享,更高效分摊成本,原生工具无缝使用。

JavaScript 7,034 1,181 Updated Jan 13, 2026

Use Claude Code as the foundation for coding infrastructure, allowing you to decide how to interact with the model while enjoying updates from Anthropic.

TypeScript 25,455 1,995 Updated Jan 10, 2026

Lichtblick is an integrated visualization and diagnosis tool for robotics, available in your browser or as a desktop app on Linux, Windows, and macOS.

TypeScript 683 636 Updated Jan 13, 2026

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 3,838 293 Updated Jan 13, 2026

🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!

Python 5,986 640 Updated Dec 27, 2025

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 37,160 4,412 Updated Jan 13, 2026

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 9,433 1,260 Updated Jan 12, 2026

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 3,720 312 Updated Nov 28, 2025

A PyTorch native platform for training generative AI models

Python 4,956 664 Updated Jan 13, 2026

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 82,876 12,461 Updated Jan 10, 2026

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 2,818 237 Updated Jan 8, 2026

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)

Python 8,775 848 Updated Jan 8, 2026

A framework for few-shot evaluation of language models.

Python 11,161 2,959 Updated Jan 7, 2026

Efficient Triton Kernels for LLM Training

Python 6,033 459 Updated Jan 13, 2026

A library for mechanistic interpretability of GPT-style language models

Python 2,973 493 Updated Jan 8, 2026

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.

Python 50,624 4,175 Updated Jan 13, 2026

Building blocks for foundation models.

592 28 Updated Jan 3, 2024

Large Language Model Text Generation Inference

Python 10,727 1,251 Updated Jan 8, 2026
Next