Stars
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
State-of-the-Art Text Embeddings
Multilingual Document Layout Parsing in a Single Vision-Language Model
ncnn android yolo11 realtime detection, segmentation, pose estimation, classification and obb
The repository provides code for running inference with the SAM 3D Body Model (3DB), links for downloading the trained model checkpoints and datasets, and example notebooks that show how to use the…
[ECCV 2022] ByteTrack: Multi-Object Tracking by Associating Every Detection Box
[CVPR2023] MOTRv2: Bootstrapping End-to-End Multi-Object Tracking by Pretrained Object Detectors
Implementation of "TrackFormer: Multi-Object Tracking with Transformers”. [Conference on Computer Vision and Pattern Recognition (CVPR), 2022]
The training program for libfacedetection for face detection and 5-landmark detection.
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
PyTorch implementation of the TrackNet Series for real-time tracking of small, fast-moving objects in sports videos. Pre-trained models available
Implementation of paper - TrackNetV3: Enhancing ShuttleCock Tracking with Augmentations and Trajectory Rectification
基于NCNN前向推理的Android代码工程,实现基于图片和实时摄像头流(视频)的目标检测,模型包含YoloV5、YoloV7、YoloV8、YoloV10、YoloV11、rtmdet等NCNN模型文件,已包含opencv、ncnn依赖包,下载后在Android studio上安装依赖环境后可直接运行!如果感觉有所帮忙,望点击star⭐!
基于PaddleOCR重构,并且脱离PaddlePaddle深度学习训练框架的轻量级OCR,推理速度超快 —— A lightweight OCR system based on PaddleOCR, decoupled from the PaddlePaddle deep learning training framework, with ultra-fast inference speed.
SLAM-Former: Putting SLAM into One Transformer
[3DV 2026 ORAL] Official Repo of "SAIL-Recon: Large SfM by Augmenting Scene Regression with Localization"
[CVPR 2025] DEFOM-Stereo: Depth foundation model based stereo matching
KeyDecoder app lets you use your smartphone or tablet to decode your mechanical keys in seconds.
Track vehicles and persons on rk3588 / rk3399pro.
C++ image processing and machine learning library with using of SIMD: SSE, AVX, AVX-512, AMX for x86/x64, NEON for ARM.