A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 3,957 291 Updated Jan 5, 2026

ASLP-lab / WenetSpeech-Yue

A Large-scale Cantonese Speech Corpus with Multi-dimensional Annotation

Python 254 11 Updated Nov 30, 2025

freds0 / free-svc

[ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion

Python 89 11 Updated Jul 23, 2025

oomol-lab / pdf-craft

PDF craft can convert PDF files into various other formats. This project will focus on processing PDF files of scanned books.

Python 4,375 278 Updated Jan 1, 2026

EnnengYang / Awesome-Forgetting-in-Deep-Learning

A Comprehensive Survey of Forgetting in Deep Learning Beyond Continual Learning. TPAMI, 2024.

345 18 Updated Jan 2, 2026

PKU-Alignment / align-anything

Align Anything: Training All-modality Model with Feedback

Python 4,616 507 Updated Nov 27, 2025

sahagobinda / SGP

Official [AAAI] Code Repository for "Continual Learning with Scaled Gradient Projection".

Python 15 1 Updated Jun 28, 2023

hiroi-sora / Umi-OCR

OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片，PDF文档识别，排除水印/页眉页脚，扫描/生成二维码。内置多国语言库。

Python 41,217 4,087 Updated Nov 20, 2025

HW-whistleblower / True-Story-of-Pangu

诺亚盘古大模型研发背后的真正的心酸与黑暗的故事。

11,380 1,344 Updated Jul 9, 2025

Zippland / worth-calculator

Calculating the actual value of your job beyond just salary

TypeScript 2,986 188 Updated Dec 8, 2025

SHI-Labs / CuMo

CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts

Python 162 8 Updated Jun 8, 2024

zhenye234 / xcodec

AAAI 2025: Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model

Python 283 21 Updated Oct 12, 2025

hankcs / HanLP

中文分词词性标注命名实体识别依存句法分析成分句法分析语义依存分析语义角色标注指代消解风格转换语义相似度新词发现关键词短语提取自动摘要文本分类聚类拼音简繁转换自然语言处理

Python 36,055 10,900 Updated Nov 15, 2025

resemble-ai / Resemblyzer

A python package to analyze and compare voices with deep learning

Python 3,202 476 Updated Oct 12, 2023

xingchensong / S3Tokenizer

Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice

Python 493 66 Updated Dec 22, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 67,159 12,483 Updated Jan 9, 2026

karpathy / minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 23,263 3,056 Updated Aug 15, 2024

zoubohao / DenoisingDiffusionProbabilityModel-ddpm-

This may be the simplest implement of DDPM. You can directly run Main.py to train the UNet on CIFAR-10 dataset and see the amazing process of denoising.

Python 2,127 221 Updated Apr 24, 2023

hexgrad / kokoro

https://hf.co/hexgrad/Kokoro-82M

JavaScript 5,279 601 Updated Aug 6, 2025

espnet / espnet

End-to-End Speech Processing Toolkit

Python 9,682 2,370 Updated Dec 16, 2025

shibing624 / pycorrector

pycorrector is a toolkit for text error correction. 文本纠错，实现了Kenlm，T5，MacBERT，ChatGLM3，Qwen2.5等模型应用在纠错场景，开箱即用。

Python 6,331 1,162 Updated Jan 6, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dongrui Dongru1

Highlights

Block or report Dongru1

Lists (2)

🔮 Future ideas

✨ Inspiration

Stars

haoheliu / AudioLDM2

hiyouga / EasyR1

unilight / LDNet

sarulab-speech / UTMOSv2

FrontierLabs / F5R-TTS

LingweiMeng / QualifyingExamPreparing

m-bain / whisperX

imxtx / awesome-controllable-speech-synthesis

2noise / ChatTTS

facebookresearch / flow_matching