Skip to content
View mdys's full-sized avatar

Block or report mdys

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

一元机场官网

371 18 Updated Nov 23, 2025

Large Audio Language Model for Natural Voice Interactions - All-in-One Docker Image with 7 Processing Modes

Python 4 1 Updated Dec 28, 2025

Professional Antigravity Account Manager & Switcher. One-click seamless account switching for Antigravity Tools. Built with Tauri v2 + React (Rust).专业的 Antigravity 账号管理与切换工具。为 Antigravity 提供一键无缝账号切…

Rust 9,724 1,161 Updated Jan 7, 2026

Multilingual Voice Understanding Model

Python 7,300 677 Updated Dec 30, 2025

Fun-ASR-Nano-2512官方发布的仓库内容有点多,部署起来坑也比较多,本项目提供一个简化的部署方案。

Python 63 19 Updated Dec 26, 2025

Virtual whiteboard for sketching hand-drawn like diagrams

TypeScript 113,915 12,055 Updated Jan 7, 2026

X-Talk is an open-source full-duplex cascaded spoken dialogue system framework enabling low-latency, interruptible, and human-like speech interaction with a lightweight, pure-Python, production-rea…

Python 142 15 Updated Jan 4, 2026

Utilizes ONNX Runtime for audio denoising.

Python 107 17 Updated Dec 27, 2025

基于 FunASR SenseVoice 模型的实时语音识别服务,支持说话人识别、音频降噪、ASR 错误修正等高级功能。

Python 8 8 Updated Oct 14, 2025

Port of Funasr's Sense-voice model in C/C++

C 507 63 Updated Dec 19, 2025

Utilizes ONNX Runtime to transcribe audio into text.

Python 73 13 Updated Jan 4, 2026

Pseudo Streaming SenseVoice with Hotwords

Python 414 48 Updated Mar 13, 2025

很多镜像都在国外。比如 gcr 。国内下载很慢,需要加速。致力于提供连接全世界的稳定可靠安全的容器镜像服务。

Shell 12,822 1,420 Updated Jan 4, 2026

一个基于 Sherpa-ONNX 的高性能语音识别服务,支持实时VAD(语音活动检测)、多语言语音识别和声纹识别功能。

Go 64 17 Updated Jan 4, 2026

stt websockect server using sherpa-onnx

C++ 41 8 Updated Jul 30, 2025
WebAssembly 7 1 Updated Jul 28, 2022

Fun-CosyVoice3-0.5B-2512 语音合成服务的简化部署方案,以及快速测试和部署提供应用调用

Python 40 9 Updated Dec 24, 2025

UFO³: Weaving the Digital Agent Galaxy

Python 7,924 971 Updated Jan 6, 2026

Fun-Audio-Chat is a Large Audio Language Model built for natural, low-latency voice interactions.

Python 616 62 Updated Dec 25, 2025

Android Automation Tool Based on Vision-Language Models

Kotlin 925 101 Updated Dec 18, 2025

Use ChatGPT On Wechat via wechaty

TypeScript 13,299 3,793 Updated May 20, 2024

这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech recognition API implemented using FunASR.

Python 13 4 Updated Dec 17, 2025

An Open Phone Agent Model & Framework. Unlocking the AI Phone for Everyone

Python 21,068 3,376 Updated Jan 5, 2026

Burp Suite HTTP traffic monitoring & management extension for security testers

Shell 52 2 Updated Dec 28, 2025

A next.js web application that integrates AI capabilities with draw.io diagrams. This app allows you to create, modify, and enhance diagrams through natural language commands and AI-assisted visual…

TypeScript 17,216 1,767 Updated Jan 7, 2026
Kotlin 262 82 Updated Jan 7, 2026

An open source reinforcement learning framework for training, evaluating, and deploying robust trading agents.

Python 5,817 1,175 Updated Jun 9, 2024

The privacy-first, self-hosted CAPTCHA for the modern web.

JavaScript 4,691 243 Updated Dec 30, 2025

GLM-TTS: Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning

Python 842 102 Updated Dec 17, 2025
Next