Stars
[ECCV 2022] & [IJCV 2024] Official implementation of the paper: Audio-Visual Segmentation (with Semantics)
[CVPR 2025] 🔥 Official impl. of "Audio-Visual Instance Segmentation".
✨ Finder Toolbar app for macOS to open the current directory in Terminal, iTerm, Hyper or Alacritty.
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
PyTorch implementation of SimCLR: A Simple Framework for Contrastive Learning of Visual Representations
ALLWEONE® Open source AI presentation generator Gamma Alternative. Create professional slides with customizable themes and AI-generated content in minutes.
[ECCV2024] This is an official implementation for "PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model"
Recommend new arxiv papers of your interest daily according to your Zotero libarary.
High performance, easy-to-use, and scalable machine learning (ML) package, including linear model (LR), factorization machines (FM), and field-aware factorization machines (FFM) for Python and CLI …
A concise, beginner-friendly introduction to the core ideas of linear algebra.
A collection of sample agents built with Agent Development (ADK)
将gpt_academic的arxiv论文翻译单独抽取出来,更方便部署和集成arxiv论文翻译
Flux Kontext Inpainting ComfyUI Implementation
Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.
AI Powered Knowledge Graph Generator
Legacy-Mess Detector – assess the “legacy-mess level” of your code and output a beautiful report | 屎山代码检测器,评估代码的“屎山等级”并输出美观的报告
Collection of leaked system prompts
Collection of extracted System Prompts from popular chatbots like ChatGPT, Claude & Gemini
View and Interact with PDFs in React, SolidJS, Svelte and JavaScript apps
一个基于 Markmap 的在线AI思维导图工具,可将文本或Markdown一键转换为精美的思维导图,支持自定义API和导出PNG&SVG。 A web-based AI mind map tool using Markmap. Instantly converts text or Markdown into beautiful mind maps, supporting custom AP…
An open-source translation agent designed to enhance the quality of text translations by leveraging large language models
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
🍒 Cherry Studio is a desktop client that supports for multiple LLM providers.
✨ Light and Fast AI Assistant. Support: Web | iOS | MacOS | Android | Linux | Windows
OCRFlux is a lightweight yet powerful multimodal toolkit that significantly advances PDF-to-Markdown conversion, excelling in complex layout handling, complicated table parsing and cross-page conte…