qiulu66

Follow

qiulu66

Follow

6 followers · 14 following

Achievements

Achievements

Stars

KlingTeam / VANS

Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO

Python 60 Updated Nov 24, 2025

OpenHelix-Team / Awesome-VLA-RL

This repository summarizes recent advances in the VLA + RL paradigm and provides a taxonomic classification of relevant works.

343 4 Updated Oct 10, 2025

RLinf / RLinf

RLinf is a flexible and scalable open-source infrastructure designed for post-training foundation models (LLMs, VLMs, VLAs) via reinforcement learning.

Python 1,471 137 Updated Nov 28, 2025

HKU-MMLab / OmniX

Official implementation of "OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes".

Python 80 2 Updated Nov 3, 2025

NVlabs / LongLive

LongLive: Real-time Interactive Long Video Generation

Python 838 54 Updated Nov 3, 2025

allenai / molmoact

Official Repository for MolmoAct

Python 260 27 Updated Oct 26, 2025

qiulu66 / Implicit-Video-Reasoning

Python 8 Updated Sep 14, 2025

rongyaofang / prism-bench

This is the official repository for the paper "FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehensive Benchmark"

Python 109 1 Updated Sep 12, 2025

TencentARC / AudioStory

AudioStory: Generating Long-Form Narrative Audio with Large Language Models

Jupyter Notebook 288 18 Updated Sep 21, 2025

KaiyueSun98 / T2I-ReasonBench

T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation

Jupyter Notebook 32 3 Updated Sep 16, 2025

TencentARC / ARC-Hunyuan-Video-7B

Structured Video Comprehension of Real-World Shorts

Python 219 8 Updated Sep 21, 2025

InternRobotics / OST-Bench

[NeurIPS 2025] OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding

Python 67 1 Updated Sep 29, 2025

InternRobotics / StreamVLN

Official implementation of the paper: "StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling"

Python 317 17 Updated Nov 2, 2025

TencentARC / GRPO-CARE

Python 79 2 Updated Jun 23, 2025

Yukun-Huang / DreamCube

[ICCV 2025] Official implementation of the paper "DreamCube: 3D Panorama Generation via Multi-plane Synchronization".

Python 158 11 Updated Nov 5, 2025

TencentARC / TokLIP

TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation

Python 231 5 Updated Aug 18, 2025

qiulu66 / Anime-Shooter

Python 44 1 Updated Jun 4, 2025

TencentARC / Video-Holmes

Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning?

Python 78 1 Updated Jul 13, 2025

KaiyueSun98 / T2I-Personalization-with-AR

47 1 Updated Apr 20, 2025

SilentView / GigaTok

[ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"

Python 194 1 Updated Jun 26, 2025

VAST-AI-Research / HoloPart

HoloPart: Generative 3D Part Amodal Segmentation

Python 597 40 Updated Apr 11, 2025

TencentARC / AnimeGamer

[ICCV 2025] AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction

Python 338 29 Updated Apr 9, 2025

TencentARC / SEED-Bench-R1

Python 94 2 Updated Jun 23, 2025

YuqingWang1029 / TokenBridge

[ICCV2025] TokenBridge: Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation. https://yuqingwang1029.github.io/TokenBridge

Python 149 4 Updated Jul 24, 2025

KlingTeam / GameFactory

[ICCV 2025] GameFactory: Creating New Games with Generative Interactive Videos

Python 448 16 Updated Mar 22, 2025

YuqingWang1029 / PAR

[CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-project

Python 180 2 Updated Mar 20, 2025

Karine-Huang / T2I-CompBench

[Neurips 2023 & TPAMI] T2I-CompBench (++) for Compositional Text-to-image Generation Evaluation

Python 317 16 Updated Sep 2, 2025

KaiyueSun98 / T2V-CompBench

[CVPR 2025] T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation

Jupyter Notebook 99 6 Updated Oct 25, 2025

yhyang-myron / DreamComposer

[CVPR 2024] DreamComposer: Controllable 3D Object Generation via Multi-View Conditions

Python 134 3 Updated Jul 22, 2024

Pointcept / SAMPart3D

SAMPart3D: Segment Any Part in 3D Objects

Python 500 28 Updated May 4, 2025