-
Worked at Kuaishou, Baidu, Meituan
- Beijing
- https://ageliss.github.io/gqjiang/
-
AReaL Public
Forked from inclusionAI/AReaLLightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.
Python Apache License 2.0 UpdatedNov 14, 2025 -
slime Public
Forked from THUDM/slimeslime is an LLM post-training framework for RL Scaling.
Python Apache License 2.0 UpdatedNov 14, 2025 -
dots.ocr Public
Forked from rednote-hilab/dots.ocrMultilingual Document Layout Parsing in a Single Vision-Language Model
Python MIT License UpdatedOct 11, 2025 -
VeOmni Public
Forked from ByteDance-Seed/VeOmniVeOmni: Scaling any Modality Model Training to any Accelerators with PyTorch native Training Framework
Python Apache License 2.0 UpdatedAug 15, 2025 -
mbridge Public
Forked from ISEEKYAN/mbridgeBridge Megatron-Core to Hugging Face/Reinforcement Learning
Python Other UpdatedAug 8, 2025 -
SpecForge Public
Forked from sgl-project/SpecForgeTrain speculative decoding models effortlessly and port them smoothly to SGLang serving.
Python MIT License UpdatedAug 1, 2025 -
sglang Public
Forked from sgl-project/sglangSGLang is a fast serving framework for large language models and vision language models.
Python Apache License 2.0 UpdatedJul 23, 2025 -
TensorRT-Model-Optimizer Public
Forked from NVIDIA/TensorRT-Model-OptimizerA unified library of state-of-the-art model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment…
Python Other UpdatedJun 30, 2025 -
OpenRLHF Public
Forked from OpenRLHF/OpenRLHFAn Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & RFT & Dynamic Sampling & Async Agent RL)
Python Apache License 2.0 UpdatedMay 23, 2025 -
NeMo-RL Public
Forked from NVIDIA-NeMo/RLScalable toolkit for efficient model reinforcement
Python Apache License 2.0 UpdatedMay 22, 2025 -
Awesome-LLM-Inference Public
Forked from xlite-dev/Awesome-LLM-Inference📚A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, FlashAttention, PagedAttention, MLA, Parallelism etc.
Python GNU General Public License v3.0 UpdatedApr 17, 2025 -
verl Public
Forked from volcengine/verlverl: Volcano Engine Reinforcement Learning for LLMs
Python Apache License 2.0 UpdatedApr 16, 2025 -
Awesome-LLM-Compression Public
Forked from HuangOwen/Awesome-LLM-CompressionAwesome LLM compression research papers and tools.
MIT License UpdatedDec 24, 2024 -
-
llm_interview_note Public
Forked from wdndev/llm_interview_note主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题
HTML UpdatedOct 22, 2024 -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedAug 7, 2024 -
SpeculativeDecodingPapers Public
Forked from hemingkx/SpeculativeDecodingPapers📰 Must-read papers and blogs on Speculative Decoding ⚡️
Apache License 2.0 UpdatedJul 24, 2024 -
lmdeploy Public
Forked from InternLM/lmdeployLMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Python Apache License 2.0 UpdatedJun 25, 2024 -
EAGLE Public
Forked from SafeAILab/EAGLE[ICML'24] EAGLE: Speculative Sampling Requires Rethinking Feature Uncertainty
Python Apache License 2.0 UpdatedMay 26, 2024 -
llama.cpp Public
Forked from ggml-org/llama.cppLLM inference in C/C++
C++ MIT License UpdatedMar 5, 2024 -
Medusa Public
Forked from FasterDecoding/MedusaMedusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
Jupyter Notebook Apache License 2.0 UpdatedFeb 27, 2024 -
NeMo Public
Forked from NVIDIA-NeMo/NeMoNeMo: a framework for generative AI
Python Apache License 2.0 UpdatedFeb 17, 2024 -
rtp-llm Public
Forked from alibaba/rtp-llmRTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.
C++ Apache License 2.0 UpdatedFeb 5, 2024 -
LAVIS Public
Forked from salesforce/LAVISLAVIS - A One-stop Library for Language-Vision Intelligence
Jupyter Notebook BSD 3-Clause "New" or "Revised" License UpdatedJan 31, 2024 -
CLIP Public
Forked from openai/CLIPCLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Jupyter Notebook MIT License UpdatedJan 11, 2024 -
trlx Public
Forked from CarperAI/trlxA repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Python MIT License UpdatedJan 8, 2024 -
peft Public
Forked from huggingface/peft🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Python Apache License 2.0 UpdatedNov 3, 2023 -
MS-AMP Public
Forked from Azure/MS-AMPMicrosoft Automatic Mixed Precision Library
Python MIT License UpdatedOct 30, 2023 -
TensorRT-LLM Public
Forked from NVIDIA/TensorRT-LLMTensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
C++ Apache License 2.0 UpdatedOct 30, 2023 -
TransformerEngine Public
Forked from NVIDIA/TransformerEngineA library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…
Python Apache License 2.0 UpdatedOct 17, 2023