Starred repositories
Now we have become very big, Different from the original idea. Collect premium software in various categories.
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.
Python tool for converting files and office documents to Markdown.
Fogsight is an AI agent and animation engine powered by Large Language Models.
Build Real-Time Knowledge Graphs for AI Agents
🚀 One-stop solution for creating your digital avatar from chat history 💡 Fine-tune LLMs with your chat logs to capture your unique style, then bind to a chatbot to bring your digital self to life. …
[NeurIPS 2025] OmniSVG is the first family of end-to-end multimodal SVG generators that leverage pre-trained Vision-Language Models (VLMs), capable of generating complex and detailed SVGs, from sim…
⚡ Dynamically generated, customizable SVG that gives the appearance of typing and deleting text for use on your profile page, repositories, or website.
Pocket Flow: Codebase to Tutorial
坚持分享 GitHub 上高质量、有趣实用的开源技术教程、开发者工具、编程网站、技术资讯。A list cool, interesting projects of GitHub.
Expose your FastAPI endpoints as Model Context Protocol (MCP) tools, with Auth!
Instant voice cloning by MIT and MyShell. Audio foundation model.
Video translation and dubbing tool powered by LLMs. The video translator offers 100 language translations and one-click full-process deployment. The video translation output is optimized for platfo…
The beautiful & flexible React.js docs framework.
Stock options, RSUs, taxes — read the latest edition: www.holloway.com/ec
Production-ready platform for agentic workflow development.
The ultimate LLM/AI application development framework in Golang.
real time face swap and one-click video deepfake with only a single image
🚀 Truly open-source AI avatar(digital human) toolkit for offline video generation and digital human cloning.
TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models
pq-dong / Chat2SVG
Forked from kingnobro/Chat2SVG(CVPR 2025) Code of "Chat2SVG: Vector Graphics Generation with Large Language Models and Image Diffusion Models"
Elegant reading of real-time and hottest news
StarVector is a foundation model for SVG generation that transforms vectorization into a code generation task. Using a vision-language modeling architecture, StarVector processes both visual and te…
本项目为xiaozhi-esp32提供后端服务,帮助您快速搭建ESP32设备控制服务器。Backend service for xiaozhi-esp32, helps you quickly build an ESP32 device control server.