Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph…

Python 10,802 937 Updated Nov 2, 2025

liguodongiot / llm-action

本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）

HTML 21,656 2,536 Updated Oct 19, 2025

Infrasys-AI / AISystem

AISystem 主要是指AI系统，包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 15,467 2,223 Updated Sep 3, 2025

BBuf / how-to-optim-algorithm-in-cuda

how to optimize some algorithm in cuda.

Cuda 2,593 234 Updated Oct 30, 2025

ztxz16 / fastllm

fastllm是后端无依赖的高性能大模型推理库。同时支持张量并行推理稠密模型和混合模式推理MOE模型，任意10G以上显卡即可推理满血DeepSeek。双路9004/9005服务器+单显卡部署DeepSeek满血满精度原版模型，单并发20tps；INT4量化模型单并发30tps，多并发可达60+。

C++ 4,059 411 Updated Oct 28, 2025

wangzhaode / mnn-llm

llm deploy project based mnn. This project has merged into MNN.

C++ 1,607 176 Updated Jan 20, 2025

InternLM / lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 7,226 613 Updated Nov 1, 2025

mlc-ai / mlc-llm

Universal LLM Deployment Engine with ML Compilation

Python 21,553 1,846 Updated Oct 28, 2025

huggingface / transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 151,966 31,020 Updated Nov 2, 2025

chatchat-space / Langchain-Chatchat

Langchain-Chatchat（原Langchain-ChatGLM）基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…

Python 36,394 6,049 Updated Oct 30, 2025

DaRL-LibSignal / LibSignal

MLJ: Libsignal: an open library for traffic signal control

Python 152 24 Updated Sep 19, 2025

Farama-Foundation / PettingZoo

An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities

Python 3,163 456 Updated Oct 28, 2025

openai / gym

A toolkit for developing and comparing reinforcement learning algorithms.

Python 36,735 8,713 Updated Oct 11, 2024

LucasAlegre / sumo-rl

Reinforcement Learning environments for Traffic Signal Control with SUMO. Compatible with Gymnasium, PettingZoo, and popular RL libraries.

Python 933 239 Updated Oct 18, 2025

theme-next / hexo-theme-next

Elegant and powerful theme for Hexo.

Stylus 8,292 2,036 Updated Jun 27, 2024

guyulongcs / Awesome-Deep-Learning-Papers-for-Search-Recommendation-Advertising

Awesome Deep Learning papers for industrial Search, Recommendation and Advertisement. They focus on Embedding, Matching, Pre-Ranking, Ranking (CTR/CVR prediction), Post Ranking, Relevance, LLM, Rei…

Python 2,171 275 Updated Oct 18, 2025

CityBrainChallenge / KDDCup2021-CityBrainChallenge-starter-kit

Python 77 40 Updated Aug 18, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zhikaiiii

Achievements

Achievements

Block or report Zhikaiiii

Stars

THUDM / slime

NVIDIA / Megatron-LM

zhaochenyang20 / Awesome-ML-SYS-Tutorial

GeeeekExplorer / nano-vllm

volcengine / verl

LMCache / LMCache

modelscope / awesome-deep-reasoning

hyperai / triton-cn

srush / GPU-Puzzles

pytorch / torchtitan

modelscope / ms-agent

AlibabaPAI / torchacc

66RING / tiny-flash-attention

modelscope / ms-swift