Skip to content
View ronghanghu's full-sized avatar

Organizations

@BVLC @DarrellGroup

Block or report ronghanghu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🚀 Efficient implementations of state-of-the-art linear attention models

Python 3,937 316 Updated Nov 29, 2025

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,328 442 Updated Nov 28, 2025

The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…

Python 4,697 433 Updated Nov 29, 2025

Detect Anything via Next Point Prediction (Based on Qwen2.5-VL-3B)

Jupyter Notebook 897 56 Updated Nov 25, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 16,888 2,684 Updated Nov 29, 2025

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 1,773 117 Updated Sep 16, 2025

Microsoft PowerToys is a collection of utilities that help you customize Windows and streamline everyday tasks

C# 126,127 7,511 Updated Nov 29, 2025

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 17,851 2,248 Updated Dec 25, 2024

The benchmark for "Video Object Segmentation in Panoptic Wild Scenes".

Python 12 1 Updated Oct 17, 2023

[ECCV2022] MOTR: End-to-End Multiple-Object Tracking with TRansformer

Python 751 106 Updated Jan 15, 2024

[CVPR2023] MOTRv2: Bootstrapping End-to-End Multi-Object Tracking by Pretrained Object Detectors

Python 452 60 Updated Feb 28, 2023

A PyTorch implementation of Connected Components Labeling

Jupyter Notebook 121 28 Updated Jun 8, 2023

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

Python 1,164 74 Updated Oct 21, 2024

Modern WebSocket support for Flask.

Python 312 25 Updated Jan 6, 2025

Monocular Depth Estimation Toolbox based on MMSegmentation.

Python 963 111 Updated Jul 21, 2025

Pipeline Parallelism for PyTorch

Python 784 88 Updated Aug 21, 2024

Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.

Jupyter Notebook 1,585 97 Updated Feb 16, 2024

Model parallel transformers in JAX and Haiku

Python 6,355 890 Updated Jan 21, 2023

JAX-based neural network library

Python 3,129 269 Updated Sep 29, 2025

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 3,247 204 Updated May 19, 2025

Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimentation and parallelization, and has demonstrated industry lead…

Python 540 70 Updated Nov 17, 2025

2nd solution of ICDAR 2021 Competition on Scientific Literature Parsing, Task B.

Python 465 108 Updated Jul 4, 2022

Making large AI models cheaper, faster and more accessible

Python 41,271 4,541 Updated Nov 24, 2025

JAX - A curated list of resources https://github.com/google/jax

1,981 154 Updated Sep 2, 2025

Abseil Common Libraries (Python)

Python 2,410 268 Updated Nov 25, 2025

Abseil Common Libraries (C++)

C++ 16,619 2,908 Updated Nov 29, 2025

ConvMAE: Masked Convolution Meets Masked Autoencoders

Python 519 42 Updated Mar 14, 2023

A paper list of some recent Transformer-based CV works.

1,384 146 Updated Nov 19, 2025
Next