Skip to content
View lixikun's full-sized avatar
  • Ztesoft
  • zhengzhou,henan Provice,China

Block or report lixikun

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Long-form streaming TTS system for multi-speaker dialogue generation

Python 1,314 122 Updated Oct 26, 2025

An easy implementation of vLLM based on the FireRedASR project

Python 13 Updated Sep 12, 2025

High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.

Python 1,381 93 Updated Jan 9, 2026
Python 5 Updated Apr 9, 2025

Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.

Python 677 52 Updated Jan 8, 2026

A lightweight SLAM repository for FireredASR-LLM fine-tuning.

Python 4 1 Updated Oct 21, 2025

Fast C++ logging library.

C++ 28,094 5,012 Updated Jan 10, 2026

GLM-ASR-Nano: A robust, open-source speech recognition model with 1.5B parameters

Python 670 58 Updated Dec 30, 2025

A high-performance REST toolkit written in C++

C++ 3,433 719 Updated Dec 15, 2025

A reproduction of CT-Transformer for punctuation restoration and disfluency detection.

C++ 2 1 Updated Oct 8, 2024

index-tts to OpenAI API server

Python 5 1 Updated May 30, 2025

Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages

Python 2,572 219 Updated Dec 30, 2025

We Speech Toolkit, LLM based Speech Toolkit for Speech Understanding, Generation, and Interaction

Python 172 13 Updated Jan 7, 2026

A markdown editor that you can deploy on your own servers to achieve cloud storage and device synchronization(支持私有部署的云端存储双链笔记软件)

Java 3,771 340 Updated Dec 23, 2025

一款私有云笔记,git + markdown

Java 257 52 Updated May 23, 2023

🎉 vue admin,vue3 admin,vue3.0 admin,vue后台管理,vue-admin,vue3.0-admin,admin,vue-admin,vue-element-admin,ant-design,vab admin pro,vab admin plus,vue admin plus,vue admin pro

Vue 18,598 3,920 Updated Jan 9, 2026

Write scalable load tests in plain Python 🚗💨

Python 27,316 3,161 Updated Jan 8, 2026
Python 673 68 Updated Dec 30, 2025

Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching

Python 762 107 Updated Dec 2, 2025

Intelligent Input Bus for Linux/Unix

C 952 197 Updated Jan 9, 2026

Voice Activity Detector (VAD) : low-latency, high-performance and lightweight

C 1,885 148 Updated Dec 23, 2025

Official implementation of the INTERSPEECH 2024 paper: Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detection

Python 53 9 Updated Dec 4, 2024

low-latency realtime ASR based on FireRedASR

Python 54 11 Updated Jul 8, 2025

PPOCRLabelv2 is a semi-automatic graphic annotation tool suitable for OCR field, with built-in PP-OCR model to automatically detect and re-recognize data.

Python 371 93 Updated Oct 14, 2025

ㄓ rime for python 🐍️

Python 8 2 Updated Jan 5, 2026

Python Wrapper for RnNoise v0.2

Python 73 14 Updated Dec 18, 2025

The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.

Python 8,530 717 Updated Dec 17, 2025

Generate text images for training deep learning ocr model

Python 1,462 387 Updated Jan 17, 2022

DeepVoiceGuard is a robust solution for detecting spoofed audio in Automatic Speaker Verification (ASV) systems. This project utilizes the RawNet2 model, trained on the ASVspoof 2019 dataset, and d…

Python 3 Updated Jan 13, 2025
Next