-
Harbin Institute of Technology
- Spain
Stars
Official implementation for “Towards Single-Source Domain Generalized Object Detection via Causal Visual Prompts” (NeurIPS 2025)
[NeurIPS '25 Spotlight] Official Pytorch implementation of "Vision Transformers Don't Need Trained Registers"
CorDA: Context-Oriented Decomposition Adaptation of Large Language Models for task-aware parameter-efficient fine-tuning(NeurIPS 2024)
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
[CVPR 2025 Highlight] SoMA: Singular Value Decomposed Minor Components Adaptation for Domain Generalizable Representation Learning
Implementation of ReRAW: RGB-to-RAW Image Reconstruction via Stratified Sampling for Efficient Object Detection on the Edge
Source code for "Large Self-Supervised Models Bridge the Gap in Domain Adaptive Object Detection"
Dynamic 3D Foundation Model using Causal Transformer
[ICCV 2025] Hybrid-TTA: Continual Test-time Adaptation via Dynamic Domain Shift Detection
[ICCV 2025] UMDATrack: Unified Multi-Domain Adaptive Tracking Under Adverse Weather Conditions
Reference PyTorch implementation and models for DINOv3
[CVPR 2025 Highlight] Official code and models for Encoder-only Mask Transformer (EoMT).
Official code for the paper: Depth Anything At Any Condition
[ICCV 2025] Official implementation of the paper "Beyond RGB: Adaptive Parallel Processing for RAW Object Detection"
[ICCV 2025] Official implementation of the paper: "Dynamic-DINO: Fine-Grained Mixture of Experts Tuning for Real-time Open-Vocabulary Object Detection"
[ICCV 2025] Unbiased Region-Language Alignment for Open-Vocabulary Dense Prediction
[ICCV2025] Official repository of the paper "Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentation"
Make Large Multimodal Models excel in object detection, ICCV 2025
[ICCV25 Oral] Diving into the Fusion of Monocular Priors for Generalized Stereo Matching
Controllable-LPMoE: Adapting to Challenging Object Segmentation via Dynamic Local Priors from Mixture-of-Experts (ICCV, 2025)
[ICCV2025] Harnessing CLIP, DINO and SAM for Open Vocabulary Segmentation
[Accepted by ICCV2025] Official code of the paper "From Easy to Hard: Progressive Active Learning Framework for Infrared Small Target Detection with Single Point Supervision"
[ICCV2025] ModPrompt: Visual Modality Prompt for Adapting Vision-Language Object Detectors
[ICCV2025 Highlight] Boosting Domain Generalized and Adaptive Detection with Diffusion Models: Fitness, Generalization, and Transferability
[ICCV 2025] Pretrained Reversible Generation as Unsupervised Visual Representation Learning
Code for ICCV2025 "FreeDNA: Endowing Domain Adaptation of Diffusion-Based Dense Prediction with Training-Free Domain Noise Alignment"
[CVPR 2025 Highlight] Official code for paper "Mamba as a Bridge: Where Vision Foundation Models Meet Vision Language Models for Domain-Generalized Semantic Segmentation"
[ICCV 2025] Stronger, Steadier & Superior: Geometric Consistency in Depth VFM Forges Domain Generalized Semantic Segmentation