Skip to content
View thuantn210823's full-sized avatar
🌴
On vacation
🌴
On vacation
  • Hanoi University of Science and Technology

Block or report thuantn210823

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Awesome speech/audio LLMs, representation learning, and codec models

1,171 72 Updated Aug 13, 2025

This is the repo of the manuscript "Embedding and Beamforming: All-Neural Causal Beamformer for Multichannel Speech Enhancement", which was submitted to ICASSP2022.

Python 99 17 Updated Jun 10, 2022

Code for the ICASSP-2021 paper: Continuous Speech Separation with Conformer.

Python 119 18 Updated Mar 18, 2023
Python 202 33 Updated Dec 4, 2023
TeX 1 Updated May 12, 2024

An open source dataset for source separation

Python 450 76 Updated Feb 9, 2024

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 8,657 967 Updated Nov 8, 2025

An open source dataset for source separation

Python 6 4 Updated Nov 27, 2021
Python 65 6 Updated Feb 15, 2021

End-to-End Speech Processing Toolkit

Python 9,572 2,343 Updated Nov 5, 2025
C++ 1 Updated Jan 15, 2025
Python 65 3 Updated Feb 8, 2024

A PyTorch implementation of End-to-End Neural Diarization

Python 108 16 Updated Jun 19, 2023

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Python 2,879 211 Updated Mar 8, 2024
Python 88 13 Updated Apr 24, 2025

Run TensorFlow on ESP32 chips without pain

C++ 11 4 Updated Dec 27, 2023

PyTorch implementation of YOLO-v1 including training

Shell 165 40 Updated Nov 21, 2022

Conformer: Convolution-augmented Transformer for Speech Recognition

Python 14 2 Updated Sep 4, 2025

End-to-End Neural Diarization

Python 410 64 Updated Aug 30, 2021

Infrastructure to enable deployment of ML models to low-power resource-constrained embedded targets (including microcontrollers and digital signal processors).

C++ 2,590 954 Updated Nov 7, 2025