Stars
AI-Powered Watermark Remover using Florence-2 and LaMA: Remove watermarks from images and videos, including AI-generated content from Sora, Runway, and others. Features a modern PyWebview GUI.
基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.
a machine learning image inpainting task that instinctively removes watermarks from image indistinguishable from the ground truth image
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
This project is a crawler which trying to get all lemmas of Baidu Baike.The author has downloaded about 100,000 lemmas in one and a half hour.This project uses https://github.com/qq1367212627/XDX03…
100+ Chinese Word Vectors 上百种预训练中文词向量
🔥中文 prompt 精选🔥,ChatGPT 使用指南,提升 ChatGPT 可玩性和可用性!🚀
OpenAI CLIP text encoders for multiple languages!
【ICCV 2023】Diverse Data Augmentation with Diffusions for Effective Test-time Prompt Tuning & 【IJCV 2025】Diffusion-Enhanced Test-time Adaptation with Text and Image Augmentation
This is a list of awesome methods about data augmentation.
A data augmentations library for audio, image, text, and video.
The download methods of Vision-language Continual Pretraining Dataset P9D.
[ICCV2023] - CTP: Towards Vision-Language Continual Pretraining via Compatible Momentum Contrast and Topology Preservation
A library for efficient similarity search and clustering of dense vectors.
NeurIPS 2025 Spotlight; ICLR2024 Spotlight; CVPR 2024; EMNLP 2024
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"
A High Performance Metadata System for Kubernetes