Starred repositories
UFOMap: An Efficient Probabilistic 3D Mapping Framework That Embraces the Unknown
Taeyoung96 / FAST_LIO_ROS2
Forked from Ericsii/FAST_LIO_ROS2[ROS2 humble] ROS2 wrapper for FAST-LIO package
[ROS2 humble] Convert 3D LiDAR map to 2D Occupancy Grid Map
Active Semantic Mapping and Pose Graph Spectral Analysis for Robot Exploration
Accompanying codebase for paper"Touch begins where vision ends: Generalizable policies for contact-rich manipulation"
[CoRL 2024] ViPER: Visibility-based Pursuit-Evasion via Reinforcement Learning - Public code and model
[ICRA 2025] Real-Time LiDAR Point Cloud Compression and Transmission for Resource-constrained Robots
GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization
A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (VLN), and related multimodal learning approaches.
Official GitHub Repository for Paper "Bridging Zero-shot Object Navigation and Foundation Models through Pixel-Guided Navigation Skill", ICRA 2024
[IROS'25 Oral] WMNav: Integrating Vision-Language Models into World Models for Object Goal Navigation
[TPAMI 2024] Official repo of "ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments"
We proposed to explore and search for the target in unknown environment based on Large Language Model for multi-robot system.
[CVPR 2025] UniGoal: Towards Universal Zero-shot Goal-oriented Navigation
Leveraging Large Language Models for Visual Target Navigation
[NeurIPS 2024] SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation
Official code release for ConceptGraphs
The repository provides code associated with the paper VLFM: Vision-Language Frontier Maps for Zero-Shot Semantic Navigation (ICRA 2024)
[CVPR24] Volumetric Environment Representation for Vision-Language Navigation
A most Frontend Collection and survey of vision-language model papers, and models GitHub repository. Continuous updates.
Famous Vision Language Models and Their Architectures
Train transformer language models with reinforcement learning.
Open-source and strong foundation image recognition models.
The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.
Denoising Diffusion Probabilistic Models