Stars
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
Noise supression using deep filtering
Task 4 Large-scale weakly supervised sound event detection for smart cars
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/READ…
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
A collection of resources to make a smart speaker
Production First and Production Ready End-to-End Speech Recognition Toolkit
Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation
这是一个用C++实现ASR推理的项目,它依赖很少,安装也很简单,推理速度很快,在树莓派4B等ARM平台也可以流畅的运行。 支持的模型是由Google的Transformer模型中优化而来,数据集是开源wenetspeech(10000+小时)或阿里私有数据集(60000+小时), 所以识别效果也很好,可以媲美许多商用的ASR软件。
The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
Chinese text normalization for speech processing
TDHS (time domain harmonic scaling) library with command-line demo
A desktop app for inspecting your React JS and React Native projects. macOS, Linux, and Windows.
A Desktop port of React Native, driven by Qt, forked from Canonical
Asimple closed loop oscillatory feedback detector and suppressor based on Spectral Flatness Measure.
一步一步编写web3工具——Step-by-Step Development of Web3 Tools
A python interface for interacting with the Ethereum blockchain and ecosystem.
HODL CLUB/囤币党社区 - 致力于做一颗传播比特币&以太坊囤币思想的火种,共同提高认知水平,拥有健康富足心态,走向共同富裕之路!陆续整理发布微博KOL囤币信仰大V比如ahr999九神微博精选文章,另九神历年微博2014~2021合集2990条珍藏版已发布,欢迎下载及分享给亲朋好友!