-
Institute for AI Industry Research (AIR), Tsinghua University
- https://scholar.google.com/citations?user=mHXjEbQAAAAJ&hl=en
- https://zhengyinan-air.github.io/
Highlights
- Pro
Lists (11)
Sort Name ascending (A-Z)
Stars
Code for paper "SPG Sandwiched Policy Gradient for Masked Diffusion Language Models"
A optimized PyTorch framework for behavior cloning with flow related generative models.
TraceRL & TraDo-8B: Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models
Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"
VITRA: Scalable Vision-Language-Action Model Pretraining for Robotic Manipulation with Real-Life Human Activity Videos
Official implementation for DSRL, Steering Your Diffusion Policy with Latent Space Reinforcement Learning (CoRL 2025)
The official repository for the paper "ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning"
[NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding
The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…
Lumina-Image 2.0: A Unified and Efficient Image Generative Framework
Lumina-DiMOO - An Open-Sourced Multi-Modal Large Diffusion Language Model
Devkit and documentation for the NVIDIA Physical AI Autonomous Vehicles Dataset
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
RynnVLA-002: A Unified Vision-Language-Action and World Model
Native Multimodal Models are World Learners
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
[NeurIPS 2025] The official implementation of "Towards Robust Zero-Shot Reinforcement Learning"
The offical Implementation of "Soft-Prompted Transformer as Scalable Cross-Embodiment Vision-Language-Action Model"
[NeurIPS 2025] Official implementation for "Flow Matching-Based Autonomous Driving Planning with Advanced Interactive Behavior Modeling"
DiffusionNFT: Online Diffusion Reinforcement with Forward Process
Reference PyTorch implementation and models for DINOv3
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI