Lists (32)
Sort Name ascending (A-Z)
backbone
fast_det
移动端检测框架image caption
图生文iptv
micro
reid
rk
人体关键点
人脸
光流
分割-sam
双光融合
图传
图像抠图
大模型
小目标检测
情绪识别
惯导相关
手部姿态
无人机拍摄数据集
无人机数据集
无监督
暗光增强
条形码和二维码数据集
深度估计
爬虫
自动裁剪
蒸馏
视频识别
语音
跟踪
高通
Stars
[CVPR 2024] Official implementation of the paper "Visual In-context Learning"
[ICCV2023] MixSort: The Customized Tracker in SportsMOT
Official Pytorch implementation of "Pose2Mesh: Graph Convolutional Network for 3D Human Pose and Mesh Recovery from a 2D Human Pose", ECCV 2020
PyTorch transcribed audioset classifier, including VGGish and YAMNet, along with utils to manipulate autioset category ontology.
VLM-FO1: Bridging the Gap Between High-Level Reasoning and Fine-Grained Perception in VLMs
Muon is an optimizer for hidden layers in neural networks
The Amazon S3 Connector for PyTorch delivers high throughput for PyTorch training jobs that access and store data in Amazon S3.
The official implementation of OmniTrack: Omnidirectional Multi-Object Tracking (CVPR 2025)
Collection of common code that's shared among different research projects in FAIR computer vision team.
This is the official repo for the paper "LongCat-Flash-Omni Technical Report"
[ECCV 2024] Tracking Meets LoRA: Faster Training, Larger Model, Stronger Performance
MediaTek's TFLite delegate
Pytorch implementation of the 'Slim-neck by GSConv: a lightweight-design for real-time detector architectures'
ADNet: Leveraging Error-Bias Towards Normal Direction in Face Alignment [Official, ICCV 2021]
Detect Anything via Next Point Prediction (Based on Qwen2.5-VL-3B)
A diverse benchmark database for multi-paradigm facial beauty prediction
Faster Whisper transcription with CTranslate2
A player reidentification challenge on Basketball images. An opportunity to publish at MMSports @ ACMMM and to win 2x $500.
Running Real Time face motioncapture using Mediapipe and AR-Kit Shapekeys
Winners of the 2022 DeepSportRadar player re-identification challenge
MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone
Official PyTorch implementation for the paper Generalizable Face Landmarking Guided by Conditional Face Warping (CVPR 2024).