Stars
[AAAI 2025] SSLFusion: Scale and Space Aligned Latent Fusion Model for Multimodal 3D Object Detection
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"
OpenMMLab's next-generation platform for general 3D object detection.
Code release for our NeurIPS 2023 paper "Uni3DETR: Unified 3D Detection Transformer", our ECCV 2024 paper "OV-Uni3DETR: Towards Unified Open-Vocabulary 3D Object Detection via Cycle-Modality Propag…