Stars
A curated list of awesome HD map construction methods
Official implementation for "JanusVLN: Decoupling Semantics and Spatiality with Dual Implicit Memory for Vision-Language Navigation"
InternRobotics' open platform for building generalized navigation foundation models.
[NeurIPS 2025] CogVLA: Cognition-Aligned Vision-Language-Action Models via Instruction-Driven Routing & Sparsification
A curated list of large VLM-based VLA models for robotic manipulation.
missTL / SeqGrowGraph
Forked from MIV-XJTU/SeqGrowGraphSeqGrowGraph: Learning Lane Topology as a Chain of Graph Expansions
[RSS'25] This repository is the implementation of "NaVILA: Legged Robot Vision-Language-Action Model for Navigation"
The code for paper 'Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors'
Official implementation of the paper: "StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling"
[RSS 2024 & RSS 2025] VLN-CE evaluation code of NaVid and Uni-NaVid
[ICCV 2025] Official implementation for "SeqGrowGraph: Learning Lane Topology as a Chain of Graph Expansions"
[RSS 2025] Uni-NaVid: A Video-based Vision-Language-Action Model for Unifying Embodied Navigation Tasks.
📚这个仓库是在arxiv上收集的有关VLN,VLA,World Model,SLAM,Gaussian Splatting,非线性优化等相关论文。每天都会自动更新!issue区域是最新10篇论文
Zhaoyibinn / vggt
Forked from facebookresearch/vggt[CVPR 2025 Best Paper Award Candidate] VGGT: Visual Geometry Grounded Transformer
An app for collecting raw RGB-D scans on iOS devices.
Application for camera and sensor data logging (iOS)
missTL / FSDrive
Forked from MIV-XJTU/FSDriveThe repository has been moved to https://github.com/MIV-XJTU/FSDrive
[NeurIPS 2025 spotlight] Official implementation for "FutureSightDrive: Thinking Visually with Spatio-Temporal CoT for Autonomous Driving"
We’re looking forward to models based on DINOv3. Rankings include: BetterDepth BRIDGE BriGeS ChronoDepth Depth Any Video Depth Anything Depth Pro DepthCrafter Distill Any Depth FE2E GRIN M2SVid MAS…
Python tools for rendering, viewing and generating metric 3D depth videos. Tools for recovering and exporting camera pose and 3D geometry to popular formats as well as tools for projecting depthvid…
Official implementation for "Driving with Prior Maps: Unified Vector Prior Encoding for Autonomous Vehicle Mapping"
本项目综合运用d3、echarts来完成可视化工作,实现了对nba两场比赛的可视化数据分析,包括球员运动轨迹、个人数据、传球次数以及得分位置等多种可交互式图表。通过可视化方法,我们能够进一步深入分析球队的具体情况,便于制定更佳的战术。