Stars
Spec-driven development (SDD) for AI coding assistants.
💫 Toolkit to help you get started with Spec-Driven Development
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Build Real-Time Knowledge Graphs for AI Agents
[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"
The official repository for ERNIE 4.5 and ERNIEKit – its industrial-grade development toolkit based on PaddlePaddle.
基于FastAPI的语音服务系统,集成语音合成(TTS)和语音识别(STT)功能。使用CosyVoice2作为TTS引擎,FunASR作为STT引擎,支持零样本语音克隆、流式输出、多种语言识别等高级功能。
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Demo app for Groq plugins in LiveKit Agents
A powerful framework for building realtime voice AI agents 🤖🎙️📹
This is a speech interaction system built on an open-source model, integrating ASR, LLM, and TTS in sequence. The ASR model is SenceVoice, the LLM models are QWen2.5-0.5B/1.5B, and there are three …
The python library for real-time communication
An AI digital human real-time streaming video voice call project, including picture and voice input and picture voice output
OpenHealth, AI Health Assistant | Powered by Your Data
Generate ARKit expression from audio in realtime
Real time interactive streaming digital human
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction…
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Sealos is an AI-native Cloud Operating System built on Kubernetes that unifies the entire application lifecycle, from development in cloud IDEs to production deployment and management. It is perfec…
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching