Skip to content
View ZhengYinan-AIR's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report ZhengYinan-AIR

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code for paper "SPG Sandwiched Policy Gradient for Masked Diffusion Language Models"

Python 46 4 Updated Oct 29, 2025

A optimized PyTorch framework for behavior cloning with flow related generative models.

Python 171 4 Updated Dec 22, 2025

TraceRL & TraDo-8B: Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models

Python 377 30 Updated Dec 16, 2025

Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"

Python 391 49 Updated Dec 20, 2025

VITRA: Scalable Vision-Language-Action Model Pretraining for Robotic Manipulation with Real-Life Human Activity Videos

Python 222 9 Updated Dec 23, 2025

Official implementation for DSRL, Steering Your Diffusion Policy with Latent Space Reinforcement Learning (CoRL 2025)

Python 127 20 Updated Aug 5, 2025
Python 290 50 Updated Dec 5, 2025

The official repository for the paper "ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning"

Jupyter Notebook 130 3 Updated Dec 22, 2025

[NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding

Python 493 10 Updated Nov 14, 2025

Open-source unified multimodal model

Python 5,521 484 Updated Oct 27, 2025

The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…

Python 6,600 781 Updated Dec 21, 2025

MiMo-Embodied

Python 336 12 Updated Nov 21, 2025

Lumina-Image 2.0: A Unified and Efficient Image Generative Framework

Python 844 58 Updated Nov 3, 2025

Lumina-DiMOO - An Open-Sourced Multi-Modal Large Diffusion Language Model

Python 918 57 Updated Dec 27, 2025

Devkit and documentation for the NVIDIA Physical AI Autonomous Vehicles Dataset

Python 217 13 Updated Nov 29, 2025
Python 345 45 Updated Mar 24, 2025

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 12,096 1,281 Updated Oct 11, 2025
Python 259 22 Updated Dec 17, 2025

RynnVLA-002: A Unified Vision-Language-Action and World Model

Python 811 47 Updated Dec 2, 2025

Native Multimodal Models are World Learners

Python 1,383 52 Updated Nov 28, 2025

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Jupyter Notebook 4,287 361 Updated Nov 27, 2025

[NeurIPS 2025] The official implementation of "Towards Robust Zero-Shot Reinforcement Learning"

Python 9 4 Updated Dec 10, 2025

The offical Implementation of "Soft-Prompted Transformer as Scalable Cross-Embodiment Vision-Language-Action Model"

C++ 392 29 Updated Dec 26, 2025

[NeurIPS 2025] Official implementation for "Flow Matching-Based Autonomous Driving Planning with Advanced Interactive Behavior Modeling"

Python 120 18 Updated Nov 27, 2025

DiffusionNFT: Online Diffusion Reinforcement with Forward Process

Python 523 18 Updated Sep 22, 2025
7 Updated Nov 10, 2025

Dream 7B, a large diffusion language model

Python 1,127 72 Updated Nov 21, 2025
21 Updated Sep 26, 2025

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 9,122 670 Updated Nov 20, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,492 2,000 Updated Nov 1, 2025
Next