A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics, and features robust zero-shot text-to-speech

Python 697 45 Updated Nov 25, 2025

e1tts / e1tts.github.io

Python 8 Updated Sep 16, 2024

NVlabs / OmniVinci

OmniVinci is an omni-modal LLM for joint understanding of vision, audio, and language.

Python 581 49 Updated Oct 29, 2025

NVIDIA / BigVGAN

Official PyTorch implementation of BigVGAN (ICLR 2023)

Python 1,151 142 Updated Sep 5, 2024

meituan-longcat / LongCat-Flash-Omni

This is the official repo for the paper "LongCat-Flash-Omni Technical Report"

Python 422 23 Updated Nov 25, 2025

jgraph / drawio-desktop

Official electron build of draw.io

JavaScript 58,074 5,508 Updated Nov 17, 2025

zzhdbw / Spark-TTS

Forked from SparkAudio/Spark-TTS

Spark-TTS Inference Code

Python 7 Updated Aug 19, 2025

SparkAudio / Spark-TTS

Spark-TTS Inference Code

Python 10,742 1,146 Updated Apr 9, 2025

lifeiteng / vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Python 2,188 332 Updated Sep 10, 2025

lifeiteng / OmniSenseVoice

Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯

Python 876 42 Updated Oct 28, 2025

shibing624 / pycorrector

pycorrector is a toolkit for text error correction. 文本纠错，实现了Kenlm，T5，MacBERT，ChatGLM3，Qwen2.5等模型应用在纠错场景，开箱即用。

Python 6,270 1,155 Updated Nov 20, 2025

TW-NLP / ChineseErrorCorrector

一个面向中文文本纠错任务的综合平台，集学术研究、模型训练、模型评测和推理部署于一体，覆盖拼写纠错与语法纠错两个核心方向。

Python 437 35 Updated Nov 26, 2025

taishan1994 / awesome-chinese-text-correction

中文文本纠错相关的论文、比赛和工具。

68 5 Updated Sep 16, 2025

swiftlang / swift

The Swift Programming Language

C++ 69,365 10,605 Updated Nov 26, 2025

FunAudioLLM / OmniAudio

Python 5 2 Updated May 21, 2025

CyberAgentAILab / mbr-for-asr

Code for Re-evaluating Minimum Bayes Risk Decoding for Automatic Speech Recognition

Python 5 4 Updated Oct 28, 2025

St3p99 / speechllm

Python 1 Updated Jul 14, 2025

zhu-han / SpeechLLM

LLM-based ASR recipe with Zipformer encoder and Qwen LLM

Python 18 3 Updated Sep 25, 2025

PaddlePaddle / PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 12,387 1,948 Updated Oct 20, 2025

fxsjy / jieba

结巴中文分词

Python 34,595 6,736 Updated Aug 21, 2024

baidu / lac

百度NLP：分词，词性标注，命名实体识别，词重要性

C++ 3,972 594 Updated May 25, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Shylock shylockasr

Block or report shylockasr

Stars

Tencent-Hunyuan / HunyuanVideo-1.5

cmusphinx / pocketsphinx

vllm-project / vllm

robmsmt / ASR-Audio-Data-Links

langchain-ai / langgraph

jpuigcerver / kaldi-decoders

mathquis / node-kaldi-online-nnet3-decoder

huggingface / peft

victorfiz / Semantic-VAD

stepfun-ai / Step-Audio-EditX