Skip to content
View Zhikaiiii's full-sized avatar
  • Tsinghua University
  • Beijing

Block or report Zhikaiiii

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

slime is an LLM post-training framework for RL Scaling.

Python 2,332 237 Updated Nov 2, 2025

Ongoing research training transformer models at scale

Python 14,048 3,222 Updated Nov 2, 2025

My learning notes/codes for ML SYS.

Python 4,036 244 Updated Oct 6, 2025

Nano vLLM

Python 7,522 957 Updated Aug 31, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,012 2,405 Updated Nov 1, 2025

Supercharge Your LLM with the Fastest KV Cache Layer

Python 5,792 684 Updated Nov 2, 2025

Collect every awesome work about r1!

Python 422 15 Updated May 2, 2025

Triton Documentation in Chinese Simplified / Triton 中文文档

TypeScript 89 9 Updated Apr 15, 2025

Solve puzzles. Learn CUDA.

Jupyter Notebook 11,599 888 Updated Sep 1, 2024

A PyTorch native platform for training generative AI models

Python 4,631 590 Updated Nov 2, 2025

MS-Agent: Lightweight Framework for Empowering Agents with Autonomous Exploration in Complex Task Scenarios

Python 3,544 404 Updated Nov 1, 2025

PyTorch distributed training acceleration framework

Python 53 9 Updated Aug 13, 2025

flash attention tutorial written in python, triton, cuda, cutlass

Cuda 441 47 Updated May 14, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph…

Python 10,802 937 Updated Nov 2, 2025

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 21,656 2,536 Updated Oct 19, 2025

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 15,467 2,223 Updated Sep 3, 2025

how to optimize some algorithm in cuda.

Cuda 2,593 234 Updated Oct 30, 2025

fastllm是后端无依赖的高性能大模型推理库。同时支持张量并行推理稠密模型和混合模式推理MOE模型,任意10G以上显卡即可推理满血DeepSeek。双路9004/9005服务器+单显卡部署DeepSeek满血满精度原版模型,单并发20tps;INT4量化模型单并发30tps,多并发可达60+。

C++ 4,059 411 Updated Oct 28, 2025

llm deploy project based mnn. This project has merged into MNN.

C++ 1,607 176 Updated Jan 20, 2025

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 7,226 613 Updated Nov 1, 2025

Universal LLM Deployment Engine with ML Compilation

Python 21,553 1,846 Updated Oct 28, 2025

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 151,966 31,020 Updated Nov 2, 2025

Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…

Python 36,394 6,049 Updated Oct 30, 2025

MLJ: Libsignal: an open library for traffic signal control

Python 152 24 Updated Sep 19, 2025

An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities

Python 3,163 456 Updated Oct 28, 2025

A toolkit for developing and comparing reinforcement learning algorithms.

Python 36,735 8,713 Updated Oct 11, 2024

Reinforcement Learning environments for Traffic Signal Control with SUMO. Compatible with Gymnasium, PettingZoo, and popular RL libraries.

Python 933 239 Updated Oct 18, 2025

Elegant and powerful theme for Hexo.

Stylus 8,292 2,036 Updated Jun 27, 2024

Awesome Deep Learning papers for industrial Search, Recommendation and Advertisement. They focus on Embedding, Matching, Pre-Ranking, Ranking (CTR/CVR prediction), Post Ranking, Relevance, LLM, Rei…

Python 2,171 275 Updated Oct 18, 2025
Next