Stars
[CVPR 2025] Multiple Object Tracking as ID Prediction
A DeNoising FPN with Transformer R-CNN for Tiny Object Detection
Official code for the AAAI 2026 paper ”Spatio-Temporal Context Learning with Temporal Difference Convolution for Moving Infrared Small Target Detection“
Generalized Focal Loss: Learning Qualified and Distributed Bounding Boxes for Dense Object Detection, NeurIPS2020
Official implementation of ICCV2023 VideoFlow: Exploiting Temporal Cues for Multi-frame Optical Flow Estimation
[TGRS 24] Triple-domain Feature Learning with Frequency-aware Memory Enhancement for Moving Infrared Small Target Detection
[ICRA'23] Dataset of Moving Object Detection; Official Implementation of "RGB-Event Fusion for Moving Object Detection in Autonomous Driving"
Open source simulator for autonomous vehicles built on Unreal Engine / Unity, from Microsoft AI & Research
The dataset for drone based detection and tracking is released, including both image/video, and annotations.
Official repository for VIGOR : Cross-View Image Geo-localization beyond One-to-one Retrieval
MFRGN: Multi-scale Feature Representation Generalization Network for Ground-to-Aerial Geo-localization
Pytorch implementation of Learning Cross-view Geo-localization Embeddings via Dynamic Weighted Decorrelation Regularization https://arxiv.org/abs/2211.05296
[AAAI 2025 Oral🚁] Game4Loc: A UAV Geo-Localization Benchmark from Game Data
ACM Multimedia2020 University-1652: A Multi-view Multi-source Benchmark for Drone-based Geo-localization 🚁 annotates 1652 buildings in 72 universities around the world.
Code for paper "Attention on Attention for Image Captioning". ICCV 2019
Code Repository for Liquid Time-Constant Networks (LTCs)
🔥 Large Dataset for Remote Sensing Image Change Captioning and Detection
[NeurIPS 2024 spotlight] Offical implementation of MSFA and release of SARDet_100K dataset for Large-Scale Synthetic Aperture Radar (SAR) Object Detection