Skip to content
View ZebinX's full-sized avatar

Block or report ZebinX

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (VLN), and related multimodal learning approaches.

1,863 78 Updated Oct 31, 2025

Implementation of [CVPR 2025] "DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation"

Python 866 82 Updated Feb 5, 2025

ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving

Python 320 25 Updated Oct 20, 2025

[CVPR2022] Remember Intentions: Retrospective-Memory-based Trajectory Prediction

Python 131 17 Updated Sep 11, 2022

HE-Drive: Human-Like End-to-End Driving with Vision Language Models

Python 247 16 Updated Aug 17, 2025
Python 206 18 Updated Oct 31, 2025

MOSS-TTSD is a spoken dialogue generation model that enables expressive dialogue speech synthesis in both Chinese and English, supporting zero-shot multi-speaker voice cloning, and long-form speech…

Python 1,002 87 Updated Oct 31, 2025

A curated list of awesome papers on Embodied AI and related research/industry-driven resources.

479 23 Updated Jun 3, 2025

Lumina Robotics Talent Call | Lumina社区具身智能招贤榜 | A list for Embodied AI / Robotics Jobs (PhD, RA, intern, full-time, etc

1,029 22 Updated Oct 27, 2025
Python 426 64 Updated Aug 28, 2021

Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations https://video-prediction-policy.github.io

Python 287 19 Updated May 17, 2025

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Python 1,061 57 Updated Apr 1, 2025

[ICLR 2025 Oral] The official implementation of "Diffusion-Based Planning for Autonomous Driving with Flexible Guidance"

Python 722 99 Updated Oct 28, 2025

[CVPR 2025 Highlight] Truncated Diffusion Model for Real-Time End-to-End Autonomous Driving

Python 1,063 86 Updated Jun 17, 2025

Corruption and Perturbation Robustness (ICLR 2019)

Python 1,105 150 Updated Aug 24, 2022

[CoRL '25] Pseudo-Simulation for Autonomous Driving; [NeurIPS '24] NAVSIM: Data-Driven Non-Reactive Autonomous Vehicle Simulation and Benchmarking

Python 740 73 Updated Oct 27, 2025

Repo of "GoalFlow: Goal-Driven Flow Matching for Multimodal Trajectories Generation in End-to-End Autonomous Driving"

Python 305 27 Updated Jul 14, 2025

The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."

Python 1,974 148 Updated Mar 13, 2025