This is the official repository for the paper "FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehensive Benchmark"

Python 107 1 Updated Sep 12, 2025

worldbench / survey

🌐 3D and 4D World Modeling: A Survey

HTML 636 36 Updated Oct 3, 2025

TencentARC / IC-Custom

[Arxiv'25] IC-Custom: Diverse Image Customization via In-Context Learning

Python 148 3 Updated Sep 15, 2025

Tencent-Hunyuan / HunyuanWorld-Voyager

Voyager is an interactive RGBD video generation model conditioned on camera input, and supports real-time 3D reconstruction.

Python 1,339 128 Updated Oct 22, 2025

showlab / FAR

Code for: "Long-Context Autoregressive Video Modeling with Next-Frame Prediction"

Python 279 13 Updated Apr 23, 2025

ethz-vlg / mvtracker

[ICCV 2025 Oral] MVTracker: Multi-view 3D Point Tracking

Python 420 17 Updated Nov 3, 2025

guandeh17 / Self-Forcing

Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)

Python 2,834 201 Updated Sep 12, 2025

KaiyueSun98 / T2I-ReasonBench

T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation

Jupyter Notebook 31 3 Updated Sep 16, 2025

SkyworkAI / Matrix-Game

Matrix-Game 2.0: An Open-Source, Real-Time, and Streaming Interactive World Model

Python 1,731 177 Updated Oct 4, 2025

facebookresearch / dinov3

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 8,293 570 Updated Nov 3, 2025

Tencent-Hunyuan / Hunyuan-GameCraft-1.0

Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition

Python 616 68 Updated Oct 16, 2025

Yaofang-Liu / Pusa-VidGen

Pusa: Thousands Timesteps Video Diffusion Model

Python 661 47 Updated Sep 5, 2025

microsoft / agent-lightning

The absolute trainer to light up AI agents.

Python 8,322 655 Updated Nov 15, 2025

ali-vilab / In-Context-LoRA

Official repository of In-Context LoRA for Diffusion Transformers

2,028 95 Updated Dec 20, 2024

Tencent-Hunyuan / HunyuanWorld-1.0

Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels with Hunyuan3D World Model

Python 2,424 203 Updated Oct 22, 2025

Wan-Video / Wan2.2

Wan: Open and Advanced Large-Scale Video Generative Models

Python 11,722 1,318 Updated Nov 14, 2025

lllyasviel / FramePack

Lets make video diffusion practical!

Python 16,177 1,557 Updated Oct 16, 2025

wzzheng / StreamVGGT

Code for Streaming 4D Visual Geometry Transformer

Python 712 29 Updated Oct 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Karine_H Karine-Huang

Achievements

Achievements

Highlights

Organizations

Block or report Karine-Huang

Stars

ali-vilab / VACE

HKU-MMLab / OmniX

Pointcept / Concerto

Tencent-Hunyuan / HunyuanWorld-Mirror

yihao-meng / HoloCine

deepseek-ai / DeepSeek-OCR

HKU-MMLab / Math-VR-CodePlot-CoT

HKU-MMLab / OmniPart

NVlabs / LongLive

haoyi-duan / WorldScore

nv-tlabs / vipe

PRIME-RL / SimpleVLA-RL

rongyaofang / prism-bench