-
abab.ai
- Beijing
- https://www.abab.ai/
- https://huggingface.co/PKU
Lists (1)
Sort Name ascending (A-Z)
Starred repositories
The power of Claude Code / GeminiCLI / CodexCLI + [Gemini / OpenAI / OpenRouter / Azure / Grok / Ollama / Custom Model / All Of The Above] working as one.
🎥 Python and OpenCV-based scene cut/transition detection program & library.
A minimal yet professional single agent demo project that showcases the core execution pipeline and production-grade features of agents.
The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…
Web App that indexes videos with AI (object detection, face recognition, emotion analysis), enables semantic search through natural language queries, and export scenes
A Python library for efficient image generation using CSS Flexbox
Performance-focused Python video editing library. Alternative to MoviePy, powered by Numba.
Frame-accurate video cutting with only small quality loss
The swiss army knife of lossless video/audio editing
AI-short-creator is an AI-powered tool that turns long videos into short clips. It works best for videos with multiple speakers and topics, such as interviews and documentaries. AI-short-creator fi…
KLing-Video-WatermarkRemover-Enhancer is an open-source tool developed to enhance and clean up videos generated by KLing. This project is designed to automatically remove watermarks from videos and…
Create beautiful, animated video subtitles with Python and CSS.
自动化上传视频到社交媒体:抖音、小红书、视频号、tiktok、youtube、bilibili
[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling
LLMCompiler is an Agent Architecture designed to speed up the execution of agent tasks by executing them quickly in the DAG. It also saves the cost of redundant token use by reducing the number of …
Text-audio foundation model from Boson AI
A simple rotating animation text component
ImageBind One Embedding Space to Bind Them All
An open-source, code-first Python toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.
The absolute trainer to light up AI agents.
GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset
Fogsight is an AI agent and animation engine powered by Large Language Models.
AG-UI: the Agent-User Interaction Protocol. Bring Agents into Frontend Applications.
Byterover Cipher is an opensource memory layer specifically designed for coding agents. Compatible with Cursor, Codex, Claude Code, Windsurf, Cline, Claude Desktop, Gemini CLI, AWS's Kiro, VS Code,…
A simple screen parsing tool towards pure vision based GUI agent