Skip to content
View cdyangzhenyu's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Block or report cdyangzhenyu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 13,807 1,514 Updated Oct 10, 2025

Added vLLM support to IndexTTS for faster inference.

Python 763 104 Updated Oct 21, 2025

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

TypeScript 66,361 7,016 Updated Oct 22, 2025

Use Claude Code as the foundation for coding infrastructure, allowing you to decide how to interact with the model while enjoying updates from Anthropic.

TypeScript 19,906 1,518 Updated Oct 21, 2025

Free, high-quality text-to-speech API endpoint to replace OpenAI, Azure, or ElevenLabs

Python 1,324 205 Updated Jul 1, 2025

Image tracking, Location Based AR, Marker tracking. All on the Web.

JavaScript 5,821 983 Updated Aug 30, 2025

Cut and paste your surroundings using AR

TypeScript 14,639 2,051 Updated Mar 4, 2023

✨ 一站式 LLM 聊天机器人平台及开发框架 ✨ 支持 QQ、QQ频道、Telegram、企微、飞书、钉钉 | 知识库、MCP 服务器、OpenAI、DeepSeek、Gemini、硅基流动、月之暗面、Ollama、OneAPI、Dify

Python 12,759 932 Updated Oct 21, 2025

-

TypeScript 1,736 212 Updated Jul 18, 2025

TradingAgents: Multi-Agents LLM Financial Trading Framework

Python 23,267 4,278 Updated Oct 9, 2025

本项目是基于ZLMediaKit的流媒体控制管理接口平台,支持RTSP,GB28181的设备拉流与推流控制,GB28181支持PTZ控制。

C# 160 64 Updated Feb 26, 2021

WebRTC/RTSP/RTMP/HTTP/HLS/HTTP-FLV/WebSocket-FLV/HTTP-TS/HTTP-fMP4/WebSocket-TS/WebSocket-fMP4/GB28181/SRT server and client framework based on C++11

C++ 15,946 3,831 Updated Oct 19, 2025

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 16,931 1,845 Updated Oct 21, 2025

AIGCPanel 是一个简单易用的一站式AI数字人系统,支持视频合成、声音合成、声音克隆,简化本地模型管理、一键导入和使用AI模型。

TypeScript 4,082 567 Updated Oct 14, 2025

VirtualWife是一个虚拟数字人项目,支持B站直播,支持openai、ollama

Python 2,700 425 Updated Oct 27, 2024

SOTA Open Source TTS

Python 23,341 1,930 Updated Oct 20, 2025

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 13,142 1,335 Updated Oct 1, 2025

🚀 Truly open-source AI avatar(digital human) toolkit for offline video generation and digital human cloning.

C 11,479 1,878 Updated Oct 16, 2025

run DeepSeek-R1 GGUFs on KTransformers

Python 253 16 Updated Mar 3, 2025

✨ Light and Fast AI Assistant. Support: Web | iOS | MacOS | Android | Linux | Windows

TypeScript 86,182 60,912 Updated Oct 22, 2025

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 179,164 46,054 Updated Oct 21, 2025

Production-ready platform for agentic workflow development.

TypeScript 116,981 18,061 Updated Oct 22, 2025

🤯 Lobe Chat - an open-source, modern design AI chat framework. Supports multiple AI providers (OpenAI / Claude 4 / Gemini / DeepSeek / Ollama / Qwen), Knowledge Base (file upload / RAG ), one click…

TypeScript 67,055 13,859 Updated Oct 22, 2025

🤱🏻 Turn any webpage into a desktop app with one command. 一键打包网页生成轻量桌面应用

Rust 42,953 8,164 Updated Oct 22, 2025

MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting

Python 4,854 656 Updated Sep 26, 2025

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 28,766 3,009 Updated Oct 22, 2025

A generative speech model for daily dialogue.

Python 38,010 4,125 Updated Jul 6, 2025

Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.

Python 15,482 1,154 Updated Oct 20, 2025

Heterogeneous AI Computing Virtualization Middleware(Project under CNCF)

Go 2,424 401 Updated Oct 21, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 15,258 1,180 Updated Oct 15, 2025
Next