Lyken17

🎯

Focusing

Ligeng Zhu Lyken17

🎯

Focusing

Researcher at @NVIDIA on efficient large models. Previously @mit, @sfu, @zju.

1.1k followers · 570 following

Cambridge, MA
lzhu.me

Achievements

x3 x2 x4

Achievements

x3 x2 x4

Organizations

Stars

zai-org / Open-AutoGLM

An Open Phone Agent Model & Framework. Unlocking the AI Phone for Everyone

Python 21,460 3,415 Updated Jan 5, 2026

model-architectures / GRAPE

Official implementation of GRAPE: Group Representational Position Encoding (https://arxiv.org/abs/2512.07805)

Python 70 4 Updated Jan 4, 2026

thu-nics / FrameFusion

[ICCV'25] The official code of paper "Combining Similarity and Importance for Video Token Reduction on Large Visual Language Models"

Python 67 1 Updated Nov 24, 2025

sgl-project / sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 22,232 4,007 Updated Jan 10, 2026

Princeton-AI2-Lab / DeepOCR

A reproduction of the Deepseek-OCR model including training

Python 201 18 Updated Nov 21, 2025

cambrian-mllm / cambrian-s

Cambrian-S: Towards Spatial Supersensing in Video

Python 470 17 Updated Dec 27, 2025

jax-ml / jax-llm-examples

Minimal yet performant LLM examples in pure JAX

Python 226 29 Updated Jan 3, 2026

sdan / vlm-gym

RL gym for vision language models written in JAX

Python 140 12 Updated Oct 30, 2025

MoonshotAI / Kimi-Linear

1,259 57 Updated Nov 17, 2025

ISEEKYAN / verl_megatron_practice

(best/better) practices of megatron on veRL and tuning guide

Shell 116 8 Updated Sep 26, 2025

RLsys-Foundation / TritonForge

🔥 LLM-powered GPU kernel synthesis: Train models to convert PyTorch ops into optimized Triton kernels via SFT+RL. Multi-turn compilation feedback, cross-platform NVIDIA/AMD, Kernelbook + KernelBench

Python 112 2 Updated Nov 10, 2025

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 3,259 408 Updated Jan 9, 2026

Blaizzy / mlx-vlm

MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.

Python 1,987 230 Updated Jan 6, 2026

RLinf / RLinf

RLinf: Reinforcement Learning Infrastructure for Embodied and Agentic AI

Python 2,065 208 Updated Jan 10, 2026

facebookresearch / perception_models

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 2,065 139 Updated Dec 18, 2025

yaof20 / Flash-RL

Implementation for FP8/INT8 Rollout for RL training without performence drop.

Python 281 19 Updated Nov 7, 2025

yfzhang114 / Awesome-Multimodal-Large-Language-Models

Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models

857 35 Updated Dec 4, 2025

bytedance / UI-TARS

Pioneering Automated GUI Interaction with Native Agents

Python 8,850 629 Updated Jan 8, 2026

ServiceNow / geo-bench

GEO-Bench: Toward Foundation Models for Earth Monitoring

Python 170 14 Updated Jul 16, 2025

Efficient-Large-Model / flash-attention-builder

Build Flash Attention wheels for NVIDIA clusters

1 Updated Aug 24, 2025

KellerJordan / Muon

Muon is an optimizer for hidden layers in neural networks

Python 2,181 105 Updated Nov 23, 2025

HITsz-TMG / Awesome-Large-Multimodal-Reasoning-Models

The development and future prospects of large multimodal reasoning models.

570 20 Updated Jan 9, 2026

SkyworkAI / Skywork-OR1

Unleashing the Power of Reinforcement Learning for Math and Code Reasoners

Python 738 44 Updated Jun 6, 2025

dhcode-cpp / X-R1

minimal-cost for training 0.5B R1-Zero

Python 799 102 Updated May 14, 2025

DCDmllm / AnyEdit

【CVPR 2025 Oral】Official Repo for Paper "AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea"

Jupyter Notebook 210 6 Updated Apr 5, 2025

facebookresearch / lingua

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,748 272 Updated Jul 18, 2025

MiniMax-AI / One-RL-to-See-Them-All

The official repo of One RL to See Them All: Visual Triple Unified Reinforcement Learning

Python 330 18 Updated May 31, 2025

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Python 4,550 635 Updated Jan 10, 2026

Planetable / Planet

Build and host decentralized blogs and websites on your Mac

Swift 1,723 69 Updated Dec 30, 2025

open-thought / reasoning-gym

[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Python 1,301 108 Updated Dec 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ligeng Zhu Lyken17

Achievements

Achievements

Organizations

Block or report Lyken17

Stars

zai-org / Open-AutoGLM

model-architectures / GRAPE

thu-nics / FrameFusion

sgl-project / sglang

Princeton-AI2-Lab / DeepOCR

cambrian-mllm / cambrian-s

jax-ml / jax-llm-examples

sdan / vlm-gym

MoonshotAI / Kimi-Linear

ISEEKYAN / verl_megatron_practice

RLsys-Foundation / TritonForge

THUDM / slime

Blaizzy / mlx-vlm

RLinf / RLinf

facebookresearch / perception_models

yaof20 / Flash-RL

yfzhang114 / Awesome-Multimodal-Large-Language-Models

bytedance / UI-TARS

ServiceNow / geo-bench

Efficient-Large-Model / flash-attention-builder

KellerJordan / Muon

HITsz-TMG / Awesome-Large-Multimodal-Reasoning-Models

SkyworkAI / Skywork-OR1

dhcode-cpp / X-R1

DCDmllm / AnyEdit

facebookresearch / lingua

MiniMax-AI / One-RL-to-See-Them-All

flashinfer-ai / flashinfer

Planetable / Planet

open-thought / reasoning-gym