Skip to content
View evan2jiang's full-sized avatar

Block or report evan2jiang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

exercise for nndl

Jupyter Notebook 3,306 1,463 Updated Jul 19, 2024

《神经网络与深度学习》 邱锡鹏著 Neural Network and Deep Learning

HTML 18,604 3,671 Updated Oct 7, 2022

Xmart青年论坛仓库,存放历史学生论坛和前沿讲座的视频回放和讲义,获取最新Xmart预告欢迎关注公众号【XLANCE Lab】

28 Updated Oct 27, 2025

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 13,384 1,355 Updated Oct 1, 2025

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 17,195 1,883 Updated Oct 21, 2025

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 43,371 5,749 Updated Aug 16, 2024

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Python 2,547 228 Updated Oct 30, 2025

Keyword spotting on Arm Cortex-M Microcontrollers

C 1,209 426 Updated Apr 10, 2019

Awesome speech/audio LLMs, representation learning, and codec models

1,171 72 Updated Aug 13, 2025

Paper, Code and Resources for Speech Language Model and End2End Speech Dialogue System.

187 14 Updated Nov 10, 2024

Paper, Code and Resources for Speech Language Model and End2End Speech Dialogue System.

1 Updated Dec 11, 2024

This is a speech analysis, modification and synthesis system

MATLAB 53 28 Updated Oct 18, 2021

Di♪♪Rhythm 2: Efficient And High Fidelity Song Generation Via Block Flow Matching

Python 82 2 Updated Nov 9, 2025

A library for speech data augmentation in time-domain

Python 677 59 Updated Aug 30, 2021

Open-Source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI Operator.

TypeScript 11,257 1,128 Updated Nov 9, 2025

Examples of my Claude Code infrastructure with skill auto-activation, hooks, and agents

Shell 5,398 683 Updated Oct 31, 2025

12 Lessons to Get Started Building AI Agents

Jupyter Notebook 44,217 14,968 Updated Nov 7, 2025

Microsoft AI

Python 2,163 602 Updated May 10, 2025

AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio a…

862 81 Updated Jul 8, 2025

Join the community on Discord for more discussions around Neutone! https://discord.gg/VHSMzb8Wqp

Python 564 29 Updated Nov 2, 2025

Deezer source separation library including pretrained models.

Python 27,734 3,048 Updated Apr 2, 2025

Code for the blog "Neural audio codecs: how to get audio into LLMs"

Python 129 3 Updated Oct 20, 2025

Whisper-Flamingo [Interspeech 2024] and mWhisper-Flamingo [IEEE SPL 2025] for Audio-Visual Speech Recognition and Translation

Jupyter Notebook 192 14 Updated Jul 29, 2025

Speech recognition

C 1,162 171 Updated Oct 29, 2025

中文翻译的 Hands-On-Large-Language-Models (hands-on-llms),动手学习大模型

Jupyter Notebook 1,644 177 Updated Oct 19, 2025

implementation of Monaural Speech Enhancement with Recursive Learning in the Time Domain

Python 48 7 Updated Nov 4, 2020

We Speech Toolkit, LLM based Speech Toolkit for Speech Understanding, Generation, and Interaction

Python 143 9 Updated Nov 5, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 78,285 11,569 Updated Nov 9, 2025

LLMs-from-scratch项目中文翻译

Jupyter Notebook 1,934 310 Updated Oct 15, 2025
Next