A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (VLN), and related multimodal learning approaches.

1,923 81 Updated Nov 8, 2025

zhangchen98 / Awesome-Aerial-VLN

6 Updated Oct 24, 2025

MLNLP-World / Paper-Writing-Tips

MLNLP社区用来帮助大家避免论文投稿小错误的整理仓库。 Paper Writing Tips

4,168 509 Updated May 29, 2022

facebookresearch / map-anything

MapAnything: Universal Feed-Forward Metric 3D Reconstruction

Python 2,216 124 Updated Oct 24, 2025

unitreerobotics / unifolm-world-model-action

Python 687 62 Updated Oct 1, 2025

X-Square-Robot / wall-x

Building General-Purpose Robots Based on Embodied Foundation Model

Python 587 37 Updated Nov 7, 2025

Physical-Intelligence / openpi

Python 8,699 1,081 Updated Oct 19, 2025

lpiccinelli-eth / UniK3D

[CVPR 2025] UniK3D: Universal Camera Monocular 3D Estimation

Python 631 52 Updated Sep 14, 2025

zhangganlin / vista-slam

ViSTA-SLAM: Visual SLAM with Symmetric Two-view Association

Python 153 4 Updated Nov 10, 2025

mystorm16 / FastVGGT

Code for FastVGGT: Training-Free Acceleration of Visual Geometry Transformer

Python 589 28 Updated Oct 14, 2025

InternRobotics / InternNav

InternRobotics' open platform for building generalized navigation foundation models.

Jupyter Notebook 384 38 Updated Nov 7, 2025

xiaomi-research / recogdrive

ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving

Python 336 26 Updated Nov 3, 2025

lcpmgh / colors

学术期刊配色推荐器

R 541 37 Updated Jan 27, 2025

utn-air / flownav

[IROS25] Combining Flow Matching and Depth Priors for Efficient Navigation

Python 18 2 Updated Oct 27, 2025

SkyworkAI / Matrix-Game

Matrix-Game 2.0: An Open-Source, Real-Time, and Streaming Interactive World Model

Python 1,716 176 Updated Oct 4, 2025

Xubo Luo LuoXubo

Highlights

Lists (23)

Attention mechanism

Autonomous driving

clip

Efficiency

ekf

Event Camera

Facial expression recognition

flow matching

Homography Estimation

IELTS

Image fusion

Image matching

Image retrieval

Lab homepage

Learning

Mulit sensor localization

NeRF

Paper codes

Pose estimation

Segmentation

SLAM with deep learning

Tracking

Visual localization

Starred repositories

image-matching

MATLAB

Linux

LaTeX

GitHub API

Git

Deep learning