-
Fun-ASR-Nano-2512-Deploy Public
Forked from fengin/Fun-ASR-Nano-2512-DeployFun-ASR-Nano-2512官方发布的仓库内容有点多,部署起来坑也比较多,本项目提供一个简化的部署方案。
Python UpdatedDec 26, 2025 -
-
-
GLM-ASR Public
Forked from zai-org/GLM-ASRGLM-ASR-Nano: A robust, open-source speech recognition model with 1.5B parameters
Python Apache License 2.0 UpdatedDec 10, 2025 -
livekit Public
Forked from livekit/livekitEnd-to-end realtime stack for connecting humans and AI
Go Apache License 2.0 UpdatedDec 9, 2025 -
tiny-audio Public
Forked from alexkroman/tiny-audioTrain your own speech AI model from scratch
Python UpdatedDec 9, 2025 -
minimind Public
Forked from jingyaogong/minimind🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
Python Apache License 2.0 UpdatedNov 27, 2025 -
DiariZen Public
Forked from BUTSpeechFIT/DiariZenA toolkit for speaker diarization.
Jupyter Notebook MIT License UpdatedNov 19, 2025 -
flashlight Public
Forked from flashlight/flashlightA C++ standalone library for machine learning
C++ MIT License UpdatedNov 12, 2025 -
annotated_deep_learning_paper_implementations Public
Forked from labmlai/annotated_deep_learning_paper_implementations🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
Python MIT License UpdatedNov 11, 2025 -
langchain Public
Forked from langchain-ai/langchain🦜🔗 The platform for reliable agents.
Python MIT License UpdatedNov 10, 2025 -
flashinfer Public
Forked from flashinfer-ai/flashinferFlashInfer: Kernel Library for LLM Serving
Cuda Apache License 2.0 UpdatedOct 29, 2025 -
audiomentations Public
Forked from iver56/audiomentationsA Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.
Python MIT License UpdatedSep 26, 2025 -
audiolab Public
Forked from pengzhendong/audiolabAn audio reader & writer built on top of PyAV
Python Apache License 2.0 UpdatedSep 26, 2025 -
CarelessWhisper-Streaming Public
Forked from tomer9080/CarelessWhisper-StreamingCausal streaming adaptation of OpenAI Whisper for real-time transcription on small audio chunks.
-
livekit-plugins-fireredchat-pvad Public
Forked from fireredchat-submodules/livekit-plugins-fireredchat-pvadFireRedChat pVAD plugin for LiveKit Agents
Python Apache License 2.0 UpdatedSep 16, 2025 -
VoiceBench Public
Forked from MatthewCYM/VoiceBenchVoiceBench: Benchmarking LLM-Based Voice Assistants
Python Apache License 2.0 UpdatedAug 22, 2025 -
TouchNet Public
Forked from xingchensong/TouchNetA native-PyTorch library for large scale M-LLM (text/audio) training with tp/cp/dp.
Python Apache License 2.0 UpdatedAug 6, 2025 -
ContextASR-Bench Public
Forked from MrSupW/ContextASR-BenchA Massive Contextual Speech Recognition Benchmark.
Python MIT License UpdatedJul 9, 2025 -
GigaSpeech2 Public
Forked from SpeechColab/GigaSpeech2An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement
Python Apache License 2.0 UpdatedJun 28, 2025 -
cuml Public
Forked from rapidsai/cumlcuML - RAPIDS Machine Learning Library
C++ Apache License 2.0 UpdatedJun 17, 2025 -
dasheng-denoiser Public
Forked from xiaomi-research/dasheng-denoiserOfficial PyTorch inference code for the Interspeech 2025 paper: Efficient Speech Enhancement via Embeddings from Pre-trained Generative Audioencoders
Python Apache License 2.0 UpdatedJun 16, 2025 -
AISHELL-5 Public
Forked from DaiYvhang/AISHELL-5In-car multi-channel speech transcription system of AISHELL-5.
Python Apache License 2.0 UpdatedJun 9, 2025 -
Kimi-Audio Public
Forked from MoonshotAI/Kimi-AudioKimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation
Python UpdatedJun 3, 2025 -
-
AudioLMs-Descriptive-Speech-Quality-Evaluators Public
Forked from huckiyang/AudioLMs-Descriptive-Speech-Quality-EvaluatorsICLR 2025
Python UpdatedJun 2, 2025 -
-
markitdown Public
Forked from microsoft/markitdownPython tool for converting files and office documents to Markdown.
Python MIT License UpdatedMay 23, 2025 -
Dolphin Public
Forked from DataoceanAI/DolphinDolphin is a multilingual, multitask ASR model jointly trained by DataoceanAI and Tsinghua University.
Python Apache License 2.0 UpdatedMay 19, 2025 -
torchview Public
Forked from mert-kurttutan/torchviewtorchview: visualize pytorch models
Python MIT License UpdatedMay 18, 2025