youyc22

🎯

Focusing

Yichen You youyc22

🎯

Focusing

Undergraduate at EE, Tsinghua University. Interested in Efficient DL and Reasoning.

56 followers · 85 following

Tsinghua University
Beijing China
11:47 (UTC +08:00)
yi2100237651[AT]outlook.com

Achievements

Organizations

Lists (21)

Sort

dynn

efficient reasoning

12 repositories

hhh

14 repositories

inference-engine

5 repositories

latent-reasoning

21 repositories

learning

4 repositories

mlsys

7 repositories

model architecture

11 repositories

reasoning

63 repositories

RL

16 repositories

test-time-scaling

5 repositories

training-infra

3 repositories

training-scheme

4 repositories

vla

1 repository

Stars

z-lab / dflash

Block Diffusion for Ultra-Fast Speculative Decoding

Python 344 13 Updated Jan 5, 2026

GMLR-Penn / Multiplex-Thinking

Multiplex Thinking

Python 31 2 Updated Jan 16, 2026

yzlnew / infra-skills

A collection of specialized agent skills for AI infrastructure development, enabling Claude Code to write, optimize, and debug high-performance systems.

Python 49 3 Updated Jan 14, 2026

deepseek-ai / Engram

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Python 2,845 169 Updated Jan 14, 2026

cao1zhg / sglang

Forked from sgl-project/sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 2 Updated Nov 25, 2025

chenyifanthu / THU-Cloud-Downloader

清华大学云盘 (Tsinghua Cloud) 批量下载助手，适用于分享的文件 size 过大导致无法直接下载的情况，本脚本添加了更多实用的小功能

Python 226 27 Updated Oct 26, 2024

NVlabs / GDPO

Official implementation of GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Python 281 15 Updated Jan 9, 2026

microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,954 2,690 Updated Dec 15, 2025

allenai / OLMo-core

PyTorch building blocks for the OLMo ecosystem

Python 709 130 Updated Jan 19, 2026

gpu-mode / lectures

Material for gpu-mode lectures

Jupyter Notebook 5,576 560 Updated Dec 8, 2025

alexzhang13 / rlm

General plug-and-play inference library for Recursive Language Models (RLMs), supporting various sandboxes.

Python 1,365 248 Updated Jan 15, 2026

PRIME-RL / Entropy-Mechanism-of-RL

The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.

Python 409 15 Updated Jul 11, 2025

Antizana / ouro-cache-fix

Custom cache implementation to fix KV cache bug in ByteDance/Ouro-1.4B

Python 4 Updated Nov 12, 2025

IQuestLab / IQuest-Coder-V1

Python 1,236 88 Updated Jan 11, 2026

ulab-uiuc / LLMRouter

LLMRouter: An Open-Source Library for LLM Routing

Python 1,156 95 Updated Jan 17, 2026

rail-berkeley / softlearning

Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.

Python 1,396 250 Updated Nov 29, 2023