Skip to content
View youyc22's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@New-Happiness-423B

Block or report youyc22

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Block Diffusion for Ultra-Fast Speculative Decoding

Python 344 13 Updated Jan 5, 2026

Multiplex Thinking

Python 31 2 Updated Jan 16, 2026

A collection of specialized agent skills for AI infrastructure development, enabling Claude Code to write, optimize, and debug high-performance systems.

Python 49 3 Updated Jan 14, 2026

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Python 2,845 169 Updated Jan 14, 2026

SGLang is a fast serving framework for large language models and vision language models.

Python 2 Updated Nov 25, 2025

清华大学云盘 (Tsinghua Cloud) 批量下载助手,适用于分享的文件 size 过大导致无法直接下载的情况,本脚本添加了更多实用的小功能

Python 226 27 Updated Oct 26, 2024

Official implementation of GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Python 281 15 Updated Jan 9, 2026

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,954 2,690 Updated Dec 15, 2025

PyTorch building blocks for the OLMo ecosystem

Python 709 130 Updated Jan 19, 2026

Material for gpu-mode lectures

Jupyter Notebook 5,576 560 Updated Dec 8, 2025

General plug-and-play inference library for Recursive Language Models (RLMs), supporting various sandboxes.

Python 1,365 248 Updated Jan 15, 2026

The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.

Python 409 15 Updated Jul 11, 2025

Custom cache implementation to fix KV cache bug in ByteDance/Ouro-1.4B

Python 4 Updated Nov 12, 2025

LLMRouter: An Open-Source Library for LLM Routing

Python 1,156 95 Updated Jan 17, 2026

Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.

Python 1,396 250 Updated Nov 29, 2023

Your Javis of Your Knowledge

Python 2 Updated Dec 28, 2025

[NeurIPS Spotlight 2025] Official implementation of the paper "Controlling Thinking Speed in Reasoning Models"

Python 6 Updated Dec 3, 2025

[arxiv: 2512.19673] Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies

Python 55 6 Updated Jan 4, 2026

[CanadianAI 2025] Code for paper "Intra-Layer Recurrence in Transformers for Language Modeling"

Python 6 Updated Aug 5, 2025

Block-Recurrent Dynamics in ViTs 🦖

Jupyter Notebook 23 Updated Dec 24, 2025

LLaDA2.0 is the diffusion language model series developed by InclusionAI team, Ant Group.

223 9 Updated Dec 19, 2025

Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling

Python 467 22 Updated May 17, 2025

Official implementation of "Reasoning by Superposition: A Theoretical Perspective on Chain of Continuous Thought" (NeurIPS 2025)

Jupyter Notebook 33 5 Updated Oct 8, 2025
Python 33 Updated Jan 13, 2026

📖 This is a repository for organizing papers, codes, and other resources related to Latent Reasoning.

339 7 Updated Nov 5, 2025

repo for paper https://arxiv.org/abs/2504.13837

Python 317 19 Updated Dec 17, 2025

Accelerating MoE with IO and Tile-aware Optimizations

Python 547 44 Updated Jan 14, 2026

A minimal yet professional single agent demo project that showcases the core execution pipeline and production-grade features of agents.

Python 1,225 168 Updated Jan 14, 2026

Your own data assistant

Python 1 Updated Dec 17, 2025
Next