Stars
Code for FastVGGT: Training-Free Acceleration of Visual Geometry Transformer
Code for Streaming 4D Visual Geometry Transformer
[NeurIPS 2025] Flow x RL. "ReinFlow: Fine-tuning Flow Policy with Online Reinforcement Learning". Support VLAs e.g., pi0, pi0.5. Fully open-sourced.
This repository is dedicated to collecting and sharing research papers on diffusion guidance methods.
[ICCV 2025 Oral] MVTracker: Multi-view 3D Point Tracking
Unified 3D Reconstruction and Semantic Understanding via Generalizable Gaussian Splatting from Unposed Multi-View Images
[ICRA 2025] Interactive4D: Interactive 4D LiDAR Segmentation
Discrete Diffusion Forcing (D2F): dLLMs Can Do Faster-Than-AR Inference
siiRL: Shanghai Innovation Institute RL Framework for Advanced LLMs and Multi-Agent Systems
HEDNet (NeurIPS 2023) & SAFDNet (CVPR 2024 Oral)
[ICLR 2024] Map Learning with Lane Segment for Autonomous Driving
Official Code for Epona: Autoregressive Diffusion World Model for Autonomous Driving (ICCV 2025)
Official Code for "MITracker: Multi-View Integration for Visual Object Tracking"
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
HE-Drive: Human-Like End-to-End Driving with Vision Language Models
Long-RL: Scaling RL to Long Sequences (NeurIPS 2025)
Bridging Large Vision-Language Models and End-to-End Autonomous Driving
[ICCV 2023] Tracking Anything with Decoupled Video Segmentation
Towards a Generative 3D World Engine for Embodied Intelligence
基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择ChatGPT/Claude/DeepSeek/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。
[CVPR 2025] VideoWorld is a simple generative model that learns purely from unlabeled videos—much like how babies learn by observing their environment.
Wan: Open and Advanced Large-Scale Video Generative Models
[ICCV 2025] Self-Calibrating Gaussian Splatting for Large Field-of-View Reconstruction
Stable Diffusion web UI
A general fine-tuning kit geared toward diffusion models.
Open source software that helps you create and deploy high-frequency crypto trading bots