Skip to content
View yujiwen's full-sized avatar

Block or report yujiwen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Native Multimodal Models are World Learners

Python 1,220 42 Updated Nov 7, 2025

[ArXiv 25] Stable Video Infinity: Infinite-Length Video Generation with Error Recycling

Python 515 29 Updated Nov 1, 2025

LongLive: Real-time Interactive Long Video Generation

Python 814 51 Updated Nov 3, 2025

Code release for paper "Test-Time Training Done Right"

Python 317 15 Updated Sep 8, 2025
C++ 26 Updated Oct 31, 2025

This is the official repository for the paper "FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehensive Benchmark"

Python 106 1 Updated Sep 12, 2025

Official implementation of the paper "GenCompositor: Generative Video Compositing with Diffusion Transformer"

Python 128 4 Updated Oct 5, 2025

T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation

Jupyter Notebook 31 3 Updated Sep 16, 2025

Geometry-Consistent Video Diffusion for Robotic Visual Policy Transfer

Python 21 2 Updated Nov 4, 2025

[NeurIPS 2025] VIKI‑R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning

Python 56 Updated Oct 20, 2025
Python 77 2 Updated Jun 23, 2025

[ICCV 2025 ⭐highlight⭐] Implementation of VMem: Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory

Python 390 15 Updated Jul 25, 2025

[ICCV 2025] Official implementation of the paper "DreamCube: 3D Panorama Generation via Multi-plane Synchronization".

Python 154 11 Updated Nov 5, 2025

[ARXIV’25] Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control

Python 82 Updated Jul 4, 2025

An official implementation of EvoSearch: Scaling Image and Video Generation via Test-Time Evolutionary Search

Python 99 5 Updated Oct 3, 2025

GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning

Python 101 3 Updated May 29, 2025

Open-source unified multimodal model

Python 5,278 455 Updated Oct 27, 2025

[SIGGRAPH Asia 2025] DreamO: A Unified Framework for Image Customization

Python 1,719 129 Updated Aug 14, 2025

SkyReels-V2: Infinite-length Film Generative model

Python 4,938 707 Updated Aug 11, 2025

[ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"

Python 192 1 Updated Jun 26, 2025

MineWorld: A Real-time interactive world model on Minecraft

Python 408 32 Updated Aug 6, 2025

HoloPart: Generative 3D Part Amodal Segmentation

Python 592 39 Updated Apr 11, 2025
Python 93 2 Updated Jun 23, 2025

[ICCV 2025] RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints

Python 92 7 Updated Sep 2, 2025

(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models

Python 1,035 55 Updated Aug 7, 2025

[ICLR 2025] Autoregressive Video Generation without Vector Quantization

Python 593 19 Updated Oct 29, 2025

[ICCV 2025] GameFactory: Creating New Games with Generative Interactive Videos

Python 437 16 Updated Mar 22, 2025

New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos

8,068 518 Updated Jun 9, 2025
Next