Stars
Learning to Tell Apart: Weakly Supervised Video Anomaly Detection via Disentangled Semantic Alignment (AAAI 2026)
UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer
A comprehensive list of papers investigating physical cognition in video generation, including papers, codes, and related websites.
[TPAMI 2025 & CVPR 2023] IGEV++: Iterative Multi-range Geometry Encoding Volumes for Stereo Matching
[ECCV 2024 Oral] SPLAM: Accelerating Image Generation with Sub-path Linear Approximation Model
Official implementation of "Holmes-VAD: Towards Unbiased and Explainable Video Anomaly Detection via Multi-modal LLM"
[CVPR 2024] Generating Human Motion in 3D Scenes from Text Descriptions
Code for SCIS-2025 Paper "UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation".
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
Official repo for VideoComposer: Compositional Video Synthesis with Motion Controllability
Code for our CVPR 2023 paper "MoLo: Motion-augmented Long-short Contrastive Learning for Few-shot Action Recognition".
Code for our IJCV 2023 paper "CLIP-guided Prototype Modulating for Few-shot Action Recognition".
Code for our paper "HyRSM++: Hybrid Relation Guided Temporal Set Matching for Few-shot Action Recognition".
Awesome Online Action Detection
Code for our CVPR 2022 Paper "Hybrid Relation Guided Set Matching for Few-shot Action Recognition".
Code for our ICCV 2021 Paper "OadTR: Online Action Detection with Transformers".
Code for our CVPR 2021 Paper "Self-Supervised Learning for Semi-Supervised Temporal Action Proposal".
A faster pytorch implementation of faster r-cnn