Skip to content
View Voc1's full-sized avatar

Block or report Voc1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Learning Chinese Character style with conditional GAN

Python 2,688 482 Updated Aug 9, 2019

[CVPR23] Visual Prompt Multi-Modal Tracking

Python 330 22 Updated Mar 4, 2025

🎮 An open-source game speed modifier.[一款开源的游戏变速器]

C++ 14,269 1,032 Updated Jan 2, 2026

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 65,223 6,576 Updated Nov 11, 2025

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

4,984 530 Updated Sep 25, 2024

[ICLR 2025] MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts

Python 261 13 Updated Oct 16, 2024

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

Python 16,882 3,712 Updated Jun 2, 2023

Official Pytorch Implementation for “DINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Video” (ECCV 2024)

Python 544 52 Updated Nov 23, 2024

PyTorch code and models for the DINOv2 self-supervised learning method.

Jupyter Notebook 12,204 1,153 Updated Dec 22, 2025

VideoX: a collection of video cross-modal models

Python 1,053 163 Updated Jun 3, 2024

Visual Object Tracking

Python 551 62 Updated Nov 8, 2025

(CVPR2023/TPAMI2024) Integrally Pre-Trained Transformer Pyramid Networks -- A Hierarchical Vision Transformer for Masked Image Modeling

Python 211 6 Updated Jul 28, 2024

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 36,162 5,100 Updated Jan 9, 2026

[Accepted by ICCV2025] Official code of the paper "From Easy to Hard: Progressive Active Learning Framework for Infrared Small Target Detection with Single Point Supervision"

Python 213 12 Updated Dec 8, 2025
Python 246 20 Updated Apr 22, 2022

Code release for DynamicTanh (DyT)

Python 1,032 86 Updated Mar 30, 2025

CVPR 2025: Frequency Dynamic Convolution for Dense Image Prediction

Python 317 9 Updated Oct 27, 2025

Large World Model -- Modeling Text and Video with Millions Context

Python 7,393 560 Updated Oct 19, 2024

[CVPR 2024 & TPAMI 2025] UniRepLKNet

Python 1,060 60 Updated Aug 10, 2025

Writing AI Conference Papers: A Handbook for Beginners

3,289 120 Updated Jul 16, 2025
Python 43 3 Updated Dec 17, 2025

EVA Series: Visual Representation Fantasies from BAAI

Python 2,635 189 Updated Aug 1, 2024

This repository is a paper digest of Transformer-related approaches in visual tracking tasks.

353 32 Updated Dec 13, 2025

[CVPR 2025] Learning Occlusion-Robust Vision Transformers for Real-Time UAV Tracking

Python 76 4 Updated Jun 12, 2025

EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything

Jupyter Notebook 2,463 158 Updated Dec 24, 2024

[CVPR 2025] Official PyTorch implementation of "EdgeTAM: On-Device Track Anything Model"

Jupyter Notebook 849 66 Updated Dec 8, 2025

Official Code for "MITracker: Multi-View Integration for Visual Object Tracking"

Python 121 5 Updated Jun 18, 2025

[ICCV 2025 Oral] MVTracker: Multi-view 3D Point Tracking

Python 450 18 Updated Nov 3, 2025
Next