Skip to content
View qiulu66's full-sized avatar

Block or report qiulu66

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO

Python 60 Updated Nov 24, 2025

This repository summarizes recent advances in the VLA + RL paradigm and provides a taxonomic classification of relevant works.

343 4 Updated Oct 10, 2025

RLinf is a flexible and scalable open-source infrastructure designed for post-training foundation models (LLMs, VLMs, VLAs) via reinforcement learning.

Python 1,471 137 Updated Nov 28, 2025

Official implementation of "OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes".

Python 80 2 Updated Nov 3, 2025

LongLive: Real-time Interactive Long Video Generation

Python 838 54 Updated Nov 3, 2025

Official Repository for MolmoAct

Python 260 27 Updated Oct 26, 2025
Python 8 Updated Sep 14, 2025

This is the official repository for the paper "FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehensive Benchmark"

Python 109 1 Updated Sep 12, 2025

AudioStory: Generating Long-Form Narrative Audio with Large Language Models

Jupyter Notebook 288 18 Updated Sep 21, 2025

T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation

Jupyter Notebook 32 3 Updated Sep 16, 2025

Structured Video Comprehension of Real-World Shorts

Python 219 8 Updated Sep 21, 2025

[NeurIPS 2025] OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding

Python 67 1 Updated Sep 29, 2025

Official implementation of the paper: "StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling"

Python 317 17 Updated Nov 2, 2025
Python 79 2 Updated Jun 23, 2025

[ICCV 2025] Official implementation of the paper "DreamCube: 3D Panorama Generation via Multi-plane Synchronization".

Python 158 11 Updated Nov 5, 2025

TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation

Python 231 5 Updated Aug 18, 2025
Python 44 1 Updated Jun 4, 2025

Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning?

Python 78 1 Updated Jul 13, 2025

[ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"

Python 194 1 Updated Jun 26, 2025

HoloPart: Generative 3D Part Amodal Segmentation

Python 597 40 Updated Apr 11, 2025

[ICCV 2025] AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction

Python 338 29 Updated Apr 9, 2025
Python 94 2 Updated Jun 23, 2025

[ICCV2025] TokenBridge: Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation. https://yuqingwang1029.github.io/TokenBridge

Python 149 4 Updated Jul 24, 2025

[ICCV 2025] GameFactory: Creating New Games with Generative Interactive Videos

Python 448 16 Updated Mar 22, 2025

[CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-project

Python 180 2 Updated Mar 20, 2025

[Neurips 2023 & TPAMI] T2I-CompBench (++) for Compositional Text-to-image Generation Evaluation

Python 317 16 Updated Sep 2, 2025

[CVPR 2025] T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation

Jupyter Notebook 99 6 Updated Oct 25, 2025

[CVPR 2024] DreamComposer: Controllable 3D Object Generation via Multi-View Conditions

Python 134 3 Updated Jul 22, 2024

SAMPart3D: Segment Any Part in 3D Objects

Python 500 28 Updated May 4, 2025
Next