Skip to content
View XuDongHecs's full-sized avatar

Block or report XuDongHecs

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code for FastVGGT: Training-Free Acceleration of Visual Geometry Transformer

Python 589 28 Updated Oct 14, 2025

Code for Streaming 4D Visual Geometry Transformer

Python 703 29 Updated Oct 27, 2025

[NeurIPS 2025] Flow x RL. "ReinFlow: Fine-tuning Flow Policy with Online Reinforcement Learning". Support VLAs e.g., pi0, pi0.5. Fully open-sourced.

Python 158 13 Updated Nov 7, 2025

This repository is dedicated to collecting and sharing research papers on diffusion guidance methods.

60 2 Updated Oct 13, 2025

[ICCV 2025 Oral] MVTracker: Multi-view 3D Point Tracking

Python 418 17 Updated Nov 3, 2025

Unified 3D Reconstruction and Semantic Understanding via Generalizable Gaussian Splatting from Unposed Multi-View Images

Python 94 4 Updated Sep 3, 2025

[ICRA 2025] Interactive4D: Interactive 4D LiDAR Segmentation

Python 96 6 Updated May 7, 2025

Discrete Diffusion Forcing (D2F): dLLMs Can Do Faster-Than-AR Inference

Python 194 14 Updated Sep 25, 2025

siiRL: Shanghai Innovation Institute RL Framework for Advanced LLMs and Multi-Agent Systems

Python 224 20 Updated Nov 10, 2025

HEDNet (NeurIPS 2023) & SAFDNet (CVPR 2024 Oral)

Python 180 15 Updated Sep 28, 2024

[ICLR 2024] Map Learning with Lane Segment for Autonomous Driving

Python 348 41 Updated Jul 2, 2025

Official Code for Epona: Autoregressive Diffusion World Model for Autonomous Driving (ICCV 2025)

Python 231 17 Updated Jul 22, 2025

Official Code for "MITracker: Multi-View Integration for Visual Object Tracking"

Python 111 4 Updated Jun 18, 2025

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 11,591 1,194 Updated Oct 11, 2025

HE-Drive: Human-Like End-to-End Driving with Vision Language Models

Python 247 16 Updated Aug 17, 2025

Long-RL: Scaling RL to Long Sequences (NeurIPS 2025)

Python 652 23 Updated Sep 24, 2025

诺亚盘古大模型研发背后的真正的心酸与黑暗的故事。

11,378 1,364 Updated Jul 9, 2025

Bridging Large Vision-Language Models and End-to-End Autonomous Driving

Python 495 35 Updated Dec 26, 2024
Cuda 340 48 Updated Jun 25, 2025

[ICCV 2023] Tracking Anything with Decoupled Video Segmentation

Python 1,446 136 Updated Apr 26, 2025

Towards a Generative 3D World Engine for Embodied Intelligence

Python 332 18 Updated Nov 10, 2025

Code for "Cameras as Rays"

Python 605 27 Updated May 31, 2024

Large Driving Models

246 11 Updated Jan 27, 2025

基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择ChatGPT/Claude/DeepSeek/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。

Python 39,656 9,481 Updated Oct 22, 2025

[CVPR 2025] VideoWorld is a simple generative model that learns purely from unlabeled videos—much like how babies learn by observing their environment.

Python 642 32 Updated Jul 25, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,666 2,119 Updated Jul 17, 2025

[ICCV 2025] Self-Calibrating Gaussian Splatting for Large Field-of-View Reconstruction

Python 118 12 Updated Feb 14, 2025

Stable Diffusion web UI

Python 158,106 29,346 Updated Nov 7, 2025

A general fine-tuning kit geared toward diffusion models.

Python 2,598 253 Updated Nov 10, 2025

Open source software that helps you create and deploy high-frequency crypto trading bots

Python 15,049 4,106 Updated Nov 10, 2025
Next