Skip to content
View zhiyuan081632's full-sized avatar

Block or report zhiyuan081632

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.

Python 1,312 440 Updated Jul 25, 2024

Noise supression using deep filtering

Python 3,493 344 Updated Oct 17, 2024
Python 2 1 Updated Jul 6, 2022

Task 4 Large-scale weakly supervised sound event detection for smart cars

Python 67 32 Updated Dec 20, 2021

A handy dataset of noises for ASR

22 7 Updated May 29, 2019

MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/READ…

C++ 13,459 2,102 Updated Nov 6, 2025

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 13,407 1,357 Updated Oct 1, 2025

A collection of resources to make a smart speaker

470 94 Updated Dec 20, 2019

Production First and Production Ready End-to-End Speech Recognition Toolkit

Python 4,897 1,165 Updated Nov 3, 2025

Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation

Python 4,341 309 Updated Jun 21, 2025

这是一个用C++实现ASR推理的项目,它依赖很少,安装也很简单,推理速度很快,在树莓派4B等ARM平台也可以流畅的运行。 支持的模型是由Google的Transformer模型中优化而来,数据集是开源wenetspeech(10000+小时)或阿里私有数据集(60000+小时), 所以识别效果也很好,可以媲美许多商用的ASR软件。

C 543 78 Updated Mar 19, 2023

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,924 146 Updated Apr 21, 2025

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 3,610 291 Updated Aug 14, 2025

Port of the OpenFST library to Windows

C++ 78 43 Updated Apr 23, 2024

Chinese text normalization for speech processing

Python 712 149 Updated Mar 18, 2023

TDHS (time domain harmonic scaling) library with command-line demo

C 65 10 Updated May 2, 2023
C 232 55 Updated Nov 27, 2023

webrtc audio processing

C++ 409 142 Updated May 10, 2020

整理出来的webrtc波束模块

C++ 39 21 Updated Apr 7, 2021

A desktop app for inspecting your React JS and React Native projects. macOS, Linux, and Windows.

TypeScript 15,408 962 Updated Oct 13, 2025

A Desktop port of React Native, driven by Qt, forked from Canonical

JavaScript 1,238 85 Updated Apr 29, 2021

Asimple closed loop oscillatory feedback detector and suppressor based on Spectral Flatness Measure.

C 16 3 Updated Mar 30, 2020

Stack trace visualizer

Perl 18,913 2,059 Updated Oct 20, 2024

Compiler for Neural Network hardware accelerators

C++ 3,315 701 Updated May 11, 2024

👁‍🗨 Rare and exotic sats

Rust 3,939 1,462 Updated Oct 30, 2025

一步一步编写web3工具——Step-by-Step Development of Web3 Tools

C 264 103 Updated Nov 4, 2025

A python interface for interacting with the Ethereum blockchain and ecosystem.

Python 5,424 1,834 Updated Nov 5, 2025

HODL CLUB/囤币党社区 - 致力于做一颗传播比特币&以太坊囤币思想的火种,共同提高认知水平,拥有健康富足心态,走向共同富裕之路!陆续整理发布微博KOL囤币信仰大V比如ahr999九神微博精选文章,另九神历年微博2014~2021合集2990条珍藏版已发布,欢迎下载及分享给亲朋好友!

113 9 Updated Oct 25, 2021
Next