Skip to content
View shylockasr's full-sized avatar
🍑
Re-thinking
🍑
Re-thinking

Block or report shylockasr

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

HunyuanVideo-1.5: A leading lightweight video generation model

Python 704 58 Updated Nov 26, 2025

A small speech recognizer

C 4,228 728 Updated Nov 24, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 64,028 11,552 Updated Nov 26, 2025

A list of publically available audio data that anyone can download for ASR or other speech activities

Shell 231 22 Updated Aug 6, 2021

Build resilient language agents as graphs.

Python 21,451 3,781 Updated Nov 26, 2025

Custom decoders for Kaldi

C++ 80 26 Updated Jun 10, 2019

ASR online decoding using Kaldi NNet3 GrammarFST

C++ 8 4 Updated Jul 11, 2023

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 20,140 2,113 Updated Nov 21, 2025

Semantic Voice Activity Detection adds an lightweight LLM prediction model to continuously evaluate whether a user has really finished speaking.

Python 11 2 Updated Aug 19, 2025

A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics, and features robust zero-shot text-to-speech

Python 697 45 Updated Nov 25, 2025
Python 8 Updated Sep 16, 2024

OmniVinci is an omni-modal LLM for joint understanding of vision, audio, and language.

Python 581 49 Updated Oct 29, 2025

Official PyTorch implementation of BigVGAN (ICLR 2023)

Python 1,151 142 Updated Sep 5, 2024

This is the official repo for the paper "LongCat-Flash-Omni Technical Report"

Python 422 23 Updated Nov 25, 2025

Official electron build of draw.io

JavaScript 58,074 5,508 Updated Nov 17, 2025

Spark-TTS Inference Code

Python 7 Updated Aug 19, 2025

Spark-TTS Inference Code

Python 10,742 1,146 Updated Apr 9, 2025

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Python 2,188 332 Updated Sep 10, 2025

Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯

Python 876 42 Updated Oct 28, 2025

pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,Qwen2.5等模型应用在纠错场景,开箱即用。

Python 6,270 1,155 Updated Nov 20, 2025

一个面向中文文本纠错任务的综合平台,集学术研究、模型训练、模型评测和推理部署于一体,覆盖拼写纠错与语法纠错两个核心方向。

Python 437 35 Updated Nov 26, 2025

中文文本纠错相关的论文、比赛和工具。

68 5 Updated Sep 16, 2025

The Swift Programming Language

C++ 69,365 10,605 Updated Nov 26, 2025
Python 5 2 Updated May 21, 2025

Code for Re-evaluating Minimum Bayes Risk Decoding for Automatic Speech Recognition

Python 5 4 Updated Oct 28, 2025
Python 1 Updated Jul 14, 2025

LLM-based ASR recipe with Zipformer encoder and Qwen LLM

Python 18 3 Updated Sep 25, 2025

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 12,387 1,948 Updated Oct 20, 2025

结巴中文分词

Python 34,595 6,736 Updated Aug 21, 2024

百度NLP:分词,词性标注,命名实体识别,词重要性

C++ 3,972 594 Updated May 25, 2021
Next