Skip to content
View feitianxue's full-sized avatar

Block or report feitianxue

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,144 311 Updated Nov 27, 2025

搜索、推荐、广告、用增等工业界实践文章收集(来源:知乎、Datafuntalk、技术公众号)

HTML 4,049 448 Updated Nov 27, 2025

A unified architecture deep learning framework designed specifically for ultra-large-scale sparse models.

Python 260 11 Updated Nov 19, 2025

HierarchicalKV is a part of NVIDIA Merlin and provides hierarchical key-value storage to meet RecSys requirements. The key capability of HierarchicalKV is to store key-value feature-embeddings on h…

Cuda 181 31 Updated Nov 2, 2025

Supercharge Your LLM with the Fastest KV Cache Layer

Python 6,223 757 Updated Nov 27, 2025

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 15,983 1,164 Updated Nov 27, 2025

Train speculative decoding models effortlessly and port them smoothly to SGLang serving.

Python 498 112 Updated Nov 27, 2025

Efficient and easy multi-instance LLM serving

Python 511 43 Updated Sep 3, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph…

Python 11,294 1,006 Updated Nov 27, 2025

My learning notes/codes for ML SYS.

Python 4,288 259 Updated Nov 25, 2025

Universal memory layer for AI Agents

Python 43,621 4,718 Updated Nov 27, 2025

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 2,393 162 Updated Nov 27, 2025

Train your Agent model via our easy and efficient framework

Python 1,632 155 Updated Nov 17, 2025

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 3,090 238 Updated Nov 28, 2025

Super-Efficient RLHF Training of LLMs with Parameter Reallocation

Python 325 20 Updated Apr 24, 2025

An easy-to-use framework for large scale recommendation algorithms.

Python 271 52 Updated Nov 27, 2025

HLLM: Enhancing Sequential Recommendations via Hierarchical Large Language Models for Item and User Modeling

Python 548 72 Updated Aug 26, 2025

Pytorch domain library for recommendation systems

Python 2,402 580 Updated Nov 26, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,474 820 Updated Nov 9, 2025

Reproduce R1 Zero on Logic Puzzle

Python 2,415 162 Updated Mar 20, 2025

Efficient Triton Kernels for LLM Training

Python 5,884 438 Updated Nov 28, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 16,759 2,667 Updated Nov 27, 2025

Train transformer language models with reinforcement learning.

Python 16,450 2,321 Updated Nov 27, 2025

Scalable toolkit for efficient model alignment

Python 847 103 Updated Oct 6, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 63,195 7,642 Updated Nov 27, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 8,764 1,006 Updated Nov 25, 2025

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 11,885 905 Updated Sep 30, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 20,456 3,537 Updated Nov 28, 2025

Let your Claude able to think

TypeScript 16,551 1,959 Updated Nov 4, 2025
Next