Stars
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
Added vLLM support to IndexTTS for faster inference.
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
Use Claude Code as the foundation for coding infrastructure, allowing you to decide how to interact with the model while enjoying updates from Anthropic.
Free, high-quality text-to-speech API endpoint to replace OpenAI, Azure, or ElevenLabs
Image tracking, Location Based AR, Marker tracking. All on the Web.
Cut and paste your surroundings using AR
✨ 一站式 LLM 聊天机器人平台及开发框架 ✨ 支持 QQ、QQ频道、Telegram、企微、飞书、钉钉 | 知识库、MCP 服务器、OpenAI、DeepSeek、Gemini、硅基流动、月之暗面、Ollama、OneAPI、Dify
TradingAgents: Multi-Agents LLM Financial Trading Framework
本项目是基于ZLMediaKit的流媒体控制管理接口平台,支持RTSP,GB28181的设备拉流与推流控制,GB28181支持PTZ控制。
WebRTC/RTSP/RTMP/HTTP/HLS/HTTP-FLV/WebSocket-FLV/HTTP-TS/HTTP-fMP4/WebSocket-TS/WebSocket-fMP4/GB28181/SRT server and client framework based on C++11
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
AIGCPanel 是一个简单易用的一站式AI数字人系统,支持视频合成、声音合成、声音克隆,简化本地模型管理、一键导入和使用AI模型。
VirtualWife是一个虚拟数字人项目,支持B站直播,支持openai、ollama
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
🚀 Truly open-source AI avatar(digital human) toolkit for offline video generation and digital human cloning.
run DeepSeek-R1 GGUFs on KTransformers
✨ Light and Fast AI Assistant. Support: Web | iOS | MacOS | Android | Linux | Windows
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Production-ready platform for agentic workflow development.
🤯 Lobe Chat - an open-source, modern design AI chat framework. Supports multiple AI providers (OpenAI / Claude 4 / Gemini / DeepSeek / Ollama / Qwen), Knowledge Base (file upload / RAG ), one click…
🤱🏻 Turn any webpage into a desktop app with one command. 一键打包网页生成轻量桌面应用
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
A modular graph-based Retrieval-Augmented Generation (RAG) system
A generative speech model for daily dialogue.
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
Heterogeneous AI Computing Virtualization Middleware(Project under CNCF)
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.