Starred repositories
Bili23 Downloader 是一款跨平台(Windows/Linux/macOS)的 B 站视频下载工具,支持下载 B 站投稿视频、番剧、电影等类型视频。支持多线程加速、断点续传等特性,搭配图形化界面与零配置操作,提供高效便捷的下载体验。
A Lightweight and Streaming Zero-Shot Voice Conversion via Mean Flows
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
C# binding for portaudio supporting Linux, macOS, Windows, iOS
SoulX-FlashTalk is the first 14B model to achieve sub-second start-up latency (0.87s) while maintaining a real-time throughput of 32 FPS on an 8xH800 node.
An instruct text-to-speech solution based on LLaSA and CosyVoice2 developed by the ASLP lab and collaborators.
Sapi5 interface for espeak-ng text-to-speech synthesizer
Hunt down social media accounts by username across social networks
🇨🇳 Cmirror: 专为中国大陆开发者打造的一键换源工具 (A unified CLI for managing mirrors: Pip, NPM, Docker, Cargo, Apt, Go, Brew).
Hybrid Flow Matching and GAN with Multi-Resolution Network for Few-Step High-Fidelity Audio Generation
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
All Algorithms implemented in Python
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
A Windows virtual display driver to add multiple virtual monitors to your PC! For Win10+. Works with VR, obs, streaming software, etc
Easily and securely send things from one computer to another 🐊 📦
⏺️ A simple recording program with the ability to record screens and audio on your computer.
Universal Pasteboard Across Devices
My learning notes for ML SYS.
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
A framework for efficient model inference with omni-modality models
Simple, unified interface to multiple Generative AI providers
A vibed accessible, modern RSS client for the blind
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.