-
Ztesoft
- zhengzhou,henan Provice,China
Starred repositories
Long-form streaming TTS system for multi-speaker dialogue generation
An easy implementation of vLLM based on the FireRedASR project
High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.
Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.
A lightweight SLAM repository for FireredASR-LLM fine-tuning.
GLM-ASR-Nano: A robust, open-source speech recognition model with 1.5B parameters
A high-performance REST toolkit written in C++
A reproduction of CT-Transformer for punctuation restoration and disfluency detection.
Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages
We Speech Toolkit, LLM based Speech Toolkit for Speech Understanding, Generation, and Interaction
A markdown editor that you can deploy on your own servers to achieve cloud storage and device synchronization(支持私有部署的云端存储双链笔记软件)
🎉 vue admin,vue3 admin,vue3.0 admin,vue后台管理,vue-admin,vue3.0-admin,admin,vue-admin,vue-element-admin,ant-design,vab admin pro,vab admin plus,vue admin plus,vue admin pro
Write scalable load tests in plain Python 🚗💨
Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching
ibus / ibus
Forked from phuang/ibusIntelligent Input Bus for Linux/Unix
Voice Activity Detector (VAD) : low-latency, high-performance and lightweight
Official implementation of the INTERSPEECH 2024 paper: Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detection
low-latency realtime ASR based on FireRedASR
PPOCRLabelv2 is a semi-automatic graphic annotation tool suitable for OCR field, with built-in PP-OCR model to automatically detect and re-recognize data.
The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.
Generate text images for training deep learning ocr model
DeepVoiceGuard is a robust solution for detecting spoofed audio in Automatic Speaker Verification (ASV) systems. This project utilizes the RawNet2 model, trained on the ASVspoof 2019 dataset, and d…