A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (VLN), and related multimodal learning approaches.

2,012 84 Updated Nov 21, 2025

wzcai99 / Pixel-Navigator

Official GitHub Repository for Paper "Bridging Zero-shot Object Navigation and Foundation Models through Pixel-Guided Navigation Skill", ICRA 2024

Python 123 9 Updated Oct 30, 2024

B0B8K1ng / WMNavigation

[IROS'25 Oral] WMNav: Integrating Vision-Language Models into World Models for Object Goal Navigation

Python 131 2 Updated Oct 24, 2025

xiexiexiaoxiexie / Intelligent-LiDAR-Navigation-LLM-as-Copilot

Python 16 1 Updated Mar 23, 2025

MarSaKi / ETPNav

[TPAMI 2024] Official repo of "ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments"

Python 389 33 Updated Apr 5, 2025

ybgdgh / Co-NavGPT

We proposed to explore and search for the target in unknown environment based on Large Language Model for multi-robot system.

Python 91 3 Updated Jun 30, 2024

bagh2178 / UniGoal

[CVPR 2025] UniGoal: Towards Universal Zero-shot Goal-oriented Navigation

Python 268 11 Updated Sep 16, 2025

ybgdgh / L3MVN

Leveraging Large Language Models for Visual Target Navigation

Python 141 22 Updated Oct 24, 2023

bagh2178 / SG-Nav

[NeurIPS 2024] SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation

Jupyter Notebook 287 22 Updated Sep 16, 2025

naokiyokoyama / ovon

Open Vocabulary Object Navigation

Python 101 12 Updated May 15, 2025

concept-graphs / concept-graphs

Official code release for ConceptGraphs

Python 720 101 Updated Oct 16, 2025

LYX0501 / InstructNav

Python 179 10 Updated Mar 29, 2025

bdaiinstitute / vlfm

The repository provides code associated with the paper VLFM: Vision-Language Frontier Maps for Zero-Shot Semantic Navigation (ICRA 2024)

Python 618 72 Updated Nov 12, 2025

DefaultRui / VLN-VER

[CVPR24] Volumetric Environment Representation for Vision-Language Navigation

Python 128 9 Updated Sep 9, 2024

zli12321 / Vision-Language-Models-Overview

A most Frontend Collection and survey of vision-language model papers, and models GitHub repository. Continuous updates.

455 25 Updated Oct 31, 2025

gokayfem / awesome-vlm-architectures

Famous Vision Language Models and Their Architectures

Markdown 1,090 50 Updated Feb 24, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 16,424 2,313 Updated Nov 26, 2025

xinyu1205 / recognize-anything

Open-source and strong foundation image recognition models.

Jupyter Notebook 3,491 314 Updated Feb 18, 2025

BAAI-DCAI / SpatialBot

The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.

Python 318 21 Updated Sep 14, 2025

hojonathanho / diffusion

Denoising Diffusion Probabilistic Models

Python 4,861 455 Updated Aug 29, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

nixwang

Achievements

Achievements

Block or report nixwang

Starred repositories

UnknownFreeOccupied / ufomap

LTU-RAI / ExplorationRRT

DDDyyhhh / -Fast-LIO-OctoMap-3D-SLAM-

Taeyoung96 / FAST_LIO_ROS2

Taeyoung96 / OctoMap-ROS2

BohemianRhapsodyz / semantic_exploration

Exiam6 / ViTaL

marmotlab / ViPER

HITSZ-NRSL / RCPCC

aiming-lab / GRAPE

jonyzhang2023 / awesome-embodied-vla-va-vln