Lists (1)
Sort Name ascending (A-Z)
Stars
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
Out-of-the-box DeepSeek OCR document parsing Web Studio
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
the fastest and most powerful android decompiler(native tool working without Java VM) for the APK, DEX, ODEX, OAT, JAR, AAR, and CLASS file. which supports malicious behavior detection, privacy lea…
Added vLLM support to IndexTTS for faster inference.
这是一个功能强大且易用的MySQL数据库MCP(Model Context Protocol)服务器,让你的AI助手可以安全地进行完整的数据库操作,支持多数据库连接管理、增删改查、事务管理和智能回滚功能。
Translate the video from one language to another and add dubbing.
VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
A lightweight LMM-based Document Parsing Model
Convert any text to a graph of knowledge. This can be used for Graph Augmented Generation or Knowledge Graph based QnA
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/READ…
"RAG-Anything: All-in-One RAG Framework"
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
A PPT online editor based on the web terminal | 一款基于web端的ppt在线编辑器
Office PowerPoint(.pptx) file to JSON | 将 PPTX 文件转为可读的 JSON 数据
Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
Build cross-platform desktop apps with JavaScript, HTML, and CSS