Stars
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series
Official implementation of “GaussianTalker: Real-Time High-Fidelity Talking Head Synthesis with Audio-Driven 3D Gaussian Splatting” by Kyusun Cho, Joungbin Lee, Heeji Yoon, Yeobin Hong, Jaehoon Ko,…
💬 An extensive collection of exceptional resources dedicated to the captivating world of talking face synthesis! ⭐ If you find this repo useful, please give it a star! 🤩
High-Resolution Image Synthesis with Latent Diffusion Models
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Robust Speech Recognition via Large-Scale Weak Supervision
CoTracker is a model for tracking any point (pixel) on a video.
Real-time CPU person segmentation for privacy in video calls
Resources for Multiple Object Tracking (MOT)
⛹️ Pytorch ReID: A tiny, friendly, strong pytorch implement of person re-id / vehicle re-id baseline. Tutorial 👉https://github.com/layumi/Person_reID_baseline_pytorch/tree/master/tutorial
[CVPR2022] DanceTrack: Multiple Object Tracking in Uniform Appearance and Diverse Motion
[ECCV 2022] ByteTrack: Multi-Object Tracking by Associating Every Detection Box
Reference models and tools for Cloud TPUs.
On the Unreasonable Effectiveness of Centroids in Image Retrieval
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
State-of-the-art 2D and 3D Face Analysis Project
A treasure chest for visual classification and recognition powered by PaddlePaddle
[CVPR'22 Oral] GMFlow: Learning Optical Flow via Global Matching
Top-level directory for documentation and general content
Neural network model repository for highly sparse and sparse-quantized models with matching sparsification recipes
Sparsity-aware deep learning inference runtime for CPUs
Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models
Visualizer for neural network, deep learning and machine learning models
🎓 Sharing machine learning course / lecture notes.
Machine Learning Interviews from FAANG, Snapchat, LinkedIn. I have offers from Snapchat, Coupang, Stitchfix etc. Blog: mlengineer.io.
Minkowski Engine is an auto-diff neural network library for high-dimensional sparse tensors