Skip to content
View centosrhel's full-sized avatar

Block or report centosrhel

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 9 Updated Oct 14, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 16,190 1,292 Updated Nov 10, 2025

CLIP-like model evaluation

Python 786 98 Updated Nov 7, 2025

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Python 4,208 361 Updated Oct 19, 2025

Robust fine-tuning of zero-shot models

Python 748 74 Updated Apr 29, 2022

MiniRBT (中文小型预训练模型系列)

Python 296 18 Updated Jul 15, 2025

Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)

Python 10,119 1,396 Updated Jul 15, 2025

MTEB: Massive Text Embedding Benchmark

Python 2,960 501 Updated Nov 10, 2025

RayGen: Multi-Modal Dataset Reinforcement for MobileCLIP and MobileCLIP2

Python 32 2 Updated Aug 29, 2025

This repository contains the official implementation of the research paper, "FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization" ICCV 2023

Python 1,967 119 Updated Nov 30, 2023

An open source implementation of CLIP.

Python 12,916 1,195 Updated Nov 4, 2025

Enriching MS-COCO with Chinese sentences and tags for cross-lingual multimedia tasks

OpenEdge ABL 207 22 Updated Feb 12, 2025

Reproducible scaling laws for contrastive language-image learning (https://arxiv.org/abs/2212.07143)

Jupyter Notebook 179 13 Updated Jun 21, 2025

New generation of CLIP with fine grained discrimination capability, ICML2025

Python 446 24 Updated Oct 27, 2025

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 35,728 5,062 Updated Nov 6, 2025
Python 1,567 89 Updated Sep 30, 2025

Example pybind11 module built with a CMake-based build system

Python 674 223 Updated Nov 10, 2025

Retrieval and Retrieval-augmented LLMs

Python 10,811 804 Updated Oct 22, 2025

[ICLR 2025] LLaVA-MoD: Making LLaVA Tiny via MoE-Knowledge Distillation

Python 207 15 Updated Mar 31, 2025

[ECCV 2024 Oral] PetFace: A Large-Scale Dataset and Benchmark for Animal Identification https://arxiv.org/abs/2407.13555

Python 78 6 Updated Jul 27, 2025

Parsing-free RAG supported by VLMs

Python 852 68 Updated Oct 22, 2025

😎 Finding duplicate images made easy!

Python 5,529 473 Updated Aug 15, 2025

Refine high-quality datasets and visual AI models

Python 10,016 680 Updated Nov 10, 2025

Official implementation for the paper: "Multi-label Classification with Partial Annotations using Class-aware Selective Loss"

Python 132 20 Updated Aug 23, 2022

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

Python 55,977 17,320 Updated Nov 9, 2025

Effortless data labeling with AI support from Segment Anything and other awesome models.

Python 6,930 773 Updated Nov 10, 2025

Precision Search through Multi-Style Inputs

Python 73 7 Updated Jul 30, 2025

Official implementation of paper "Query2Label: A Simple Transformer Way to Multi-Label Classification".

Python 452 69 Updated Mar 18, 2022
Jupyter Notebook 689 54 Updated Nov 5, 2025

[CVPR 2022] Official code for "Unified Contrastive Learning in Image-Text-Label Space"

Python 402 31 Updated Nov 10, 2023
Next