Skip to content
View Karine-Huang's full-sized avatar

Highlights

  • Pro

Organizations

@LIU-Vision-Group

Block or report Karine-Huang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing

Python 3,413 242 Updated Oct 17, 2025

Official implementation of "OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes".

Python 67 1 Updated Nov 3, 2025

[NeurIPS'25] Official repository of Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations

Python 415 15 Updated Nov 10, 2025

Fast and Universal 3D reconstruction model for versatile tasks

Python 770 53 Updated Nov 11, 2025

Official Implementations for Paper - HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives

Python 421 59 Updated Nov 8, 2025

Contexts Optical Compression

Python 20,463 1,734 Updated Oct 25, 2025

Math-VR Benchmark & CodePlot-CoT: Mathematical Visual Reasoning by Thinking with Code-Driven Images

Python 34 2 Updated Nov 4, 2025

[SIGGRAPH Asia 2025] OmniPart: Part-Aware 3D Generation with Semantic Decoupling and Structural Cohesion

Python 146 7 Updated Nov 6, 2025

LongLive: Real-time Interactive Long Video Generation

Python 818 52 Updated Nov 3, 2025

Official implementation for WorldScore: A Unified Evaluation Benchmark for World Generation

Python 162 8 Updated Jul 25, 2025

ViPE: Video Pose Engine for Geometric 3D Perception

Python 1,507 116 Updated Oct 13, 2025

SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Python 984 48 Updated Oct 13, 2025

This is the official repository for the paper "FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehensive Benchmark"

Python 107 1 Updated Sep 12, 2025

🌐 3D and 4D World Modeling: A Survey

HTML 636 36 Updated Oct 3, 2025

[Arxiv'25] IC-Custom: Diverse Image Customization via In-Context Learning

Python 148 3 Updated Sep 15, 2025

Voyager is an interactive RGBD video generation model conditioned on camera input, and supports real-time 3D reconstruction.

Python 1,339 128 Updated Oct 22, 2025

Code for: "Long-Context Autoregressive Video Modeling with Next-Frame Prediction"

Python 279 13 Updated Apr 23, 2025

[ICCV 2025 Oral] MVTracker: Multi-view 3D Point Tracking

Python 420 17 Updated Nov 3, 2025

Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)

Python 2,834 201 Updated Sep 12, 2025

T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation

Jupyter Notebook 31 3 Updated Sep 16, 2025

Matrix-Game 2.0: An Open-Source, Real-Time, and Streaming Interactive World Model

Python 1,731 177 Updated Oct 4, 2025

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 8,293 570 Updated Nov 3, 2025

Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition

Python 616 68 Updated Oct 16, 2025

Pusa: Thousands Timesteps Video Diffusion Model

Python 661 47 Updated Sep 5, 2025

The absolute trainer to light up AI agents.

Python 8,322 655 Updated Nov 15, 2025

Official repository of In-Context LoRA for Diffusion Transformers

2,028 95 Updated Dec 20, 2024

Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels with Hunyuan3D World Model

Python 2,424 203 Updated Oct 22, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 11,722 1,318 Updated Nov 14, 2025

Lets make video diffusion practical!

Python 16,177 1,557 Updated Oct 16, 2025

Code for Streaming 4D Visual Geometry Transformer

Python 712 29 Updated Oct 27, 2025
Next