Skip to content
View LuoXubo's full-sized avatar

Highlights

  • Pro

Block or report LuoXubo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

[AAAI 2024] Mono3DVG: 3D Visual Grounding in Monocular Images, AAAI, 2024

Python 63 2 Updated Apr 9, 2024

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 9,240 947 Updated Aug 12, 2024

The version 2 of Mono 3D visiual grounding

Python 2 Updated Mar 21, 2025

Official code release for CoRL'25 paper: VT-Refine: Learning Bimanual Assembly with Visuo-Tactile Feedback via Simulation Fine-Tuning

Dockerfile 61 4 Updated Oct 18, 2025

Repository of the paper "AnyUp: Universal Feature Upsampling".

Jupyter Notebook 351 20 Updated Nov 9, 2025

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,521 42 Updated Oct 15, 2025

Reading list for research topics in embodied vision

677 79 Updated Jun 13, 2025

This is the code for the IROS2025 RoboSense challenge track1: LLM for Driving

Python 2 Updated Oct 29, 2025

Robot Learning Beyond Earth

Python 88 10 Updated Oct 29, 2025
Python 113 5 Updated Sep 8, 2025

[ACMMM 2025] Official implementation of SeqVLM: Proposal-Guided Multi-View Sequences Reasoning via VLM for Zero Shot 3D Visual Grounding

Python 10 Updated Oct 28, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 16,221 1,294 Updated Nov 10, 2025

SLAM-Former: Putting SLAM into One Transformer

381 5 Updated Sep 26, 2025

[CVPR'25] SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding

Python 182 4 Updated Apr 21, 2025

A Lunar Segmentation, Navigation and Reconstruction Dataset based on Muti-sensor for Autonomous Exploration

54 4 Updated Jun 20, 2025

A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (VLN), and related multimodal learning approaches.

1,923 81 Updated Nov 8, 2025

MLNLP社区用来帮助大家避免论文投稿小错误的整理仓库。 Paper Writing Tips

4,168 509 Updated May 29, 2022

MapAnything: Universal Feed-Forward Metric 3D Reconstruction

Python 2,216 124 Updated Oct 24, 2025

Building General-Purpose Robots Based on Embodied Foundation Model

Python 587 37 Updated Nov 7, 2025

[CVPR 2025] UniK3D: Universal Camera Monocular 3D Estimation

Python 631 52 Updated Sep 14, 2025

ViSTA-SLAM: Visual SLAM with Symmetric Two-view Association

Python 153 4 Updated Nov 10, 2025

Code for FastVGGT: Training-Free Acceleration of Visual Geometry Transformer

Python 589 28 Updated Oct 14, 2025

InternRobotics' open platform for building generalized navigation foundation models.

Jupyter Notebook 384 38 Updated Nov 7, 2025

ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving

Python 336 26 Updated Nov 3, 2025

学术期刊配色推荐器

R 541 37 Updated Jan 27, 2025

[IROS25] Combining Flow Matching and Depth Priors for Efficient Navigation

Python 18 2 Updated Oct 27, 2025

Matrix-Game 2.0: An Open-Source, Real-Time, and Streaming Interactive World Model

Python 1,716 176 Updated Oct 4, 2025
Next