TeaPoly

Lucky Wong TeaPoly

Audio and Speech Processing

126 followers · 31 following

Achievements

DeepSeekV3MoE Public

DeepSeek V3 MoE with aux-loss-free and sequence aux loss.

Python 3 Updated Feb 25, 2025
CE-OptimizedLoss Public

Optimized loss based on cross-entropy (CE), like MWER (minimum WER) Loss with beam search and negative sampling strategy, Smoothed Max Pooling Loss.

pytorch speech-recognition smp asr mwer paraformer

Python 24 6 Updated Oct 11, 2024
matmulfreellm Public
Forked from ridgerchu/matmulfreellm

Implementation for MatMul-free LM.

Python 2 Apache License 2.0 Updated Jun 27, 2024
wenet Public
Forked from wenet-e2e/wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Python Apache License 2.0 Updated Jun 3, 2024
you-get Public
Forked from soimort/you-get

⏬ Dumb downloader that scrapes the web

Python Other Updated May 10, 2024
NSD-MS2S Public
Forked from liyunlongaaa/NSD-MS2S

CHIME-7 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence architecture

Shell Updated Feb 8, 2024
AIF-PyTorch Public

(NOT Official) Implementation Auto-regressive Integrate-and-Fire (AIF)

torch cif asr aif auto-regressive-integrate-and-fire

Python 5 2 Updated Dec 18, 2023
CTC-OptimizedLoss Public

Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.

tensorflow pytorch speech-recognition knowledge-distillation mwer ctc-beam-search ctc-mwer

Python 59 11 Updated Sep 6, 2023
PLCPA-ASYM-Loss Public

The power-law compressed phase-aware asymmetric (PLCPA-ASYM) loss

pytorch noise-reduction speech-enhancement

Python 14 1 Apache License 2.0 Updated Sep 4, 2023
self_attention_alignment Public
Forked from lhwcv/self_attention_alignment

Deep model with built-in self-attention alignment for acoustic echo cancellation, Pytorch implement

Python MIT License Updated May 30, 2023
nara_wpe Public
Forked from fgnt/nara_wpe

Different implementations of "Weighted Prediction Error" for speech dereverberation

Python MIT License Updated Mar 16, 2023
TeaPoly.github.io Public

SCSS 3 Updated Feb 24, 2023
FunASR Public
Forked from modelscope/FunASR

A Fundamental End-to-End Speech Recognition Toolkit

Python MIT License Updated Dec 2, 2022
icefall Public
Forked from k2-fsa/icefall

Python Other Updated Nov 26, 2022
speexdsp-ns-python Public

Python bindings of speexdsp noise suppression library

python noise-reduction speex noise-cancellation speexdsp noise-suppression

C++ 45 5 Apache License 2.0 Updated Nov 18, 2022
Conformer-Athena Public

Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.

tensorflow transformer speech-recognition asr conformer tensorflow2 aishell

Python 44 8 Apache License 2.0 Updated Nov 2, 2022
NKF-AEC Public
Forked from fjiang9/NKF-AEC

Acoustic Echo Cancellation with Nerual Kalman Filtering

HTML Updated Sep 13, 2022
torchdistance Public
Forked from francescocastelli/torchdistance

Edit-distance PyTorch extension with Cpu and CUDA kernels

Python Updated Apr 6, 2022
asr_frontend Public

PyTorch implementation of frontend, like PCEN (per-channel energy normalization) and Mel-Filterbank (mel-filterbank).

pytorch mfcc forntend mel-filterbank pcen

Python 3 4 Updated Mar 3, 2022
PercepNet Public
Forked from jzi040941/PercepNet

(Work In Progress) Unofficial implementation of PercepNet: A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech

C++ 1 BSD 3-Clause "New" or "Revised" License Updated Feb 15, 2022
SpectrumAugmenter Public

Performs data augmentation as according to the SpecAugment paper. Modified from Lingvo (TensorFlow > 1.10.0).

tensorflow lingvo asr specaugment spectrumaugmenter

Python 1 Updated Jan 26, 2022
rir-configuration-generator Public

Generation of virtual rooms configurations.

Python 2 Updated Aug 3, 2021
warp-ctc-crf Public

An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.

tensorflow speech-recognition speech-to-text ctc ctc-crf discriminative-training warp-ctc-crf

Cuda 12 4 Updated Jul 5, 2021
py-aec-unified2021 Public
Forked from echocatzh/py-aec-unified2021

Python Updated Jun 6, 2021
unified2021 Public
Forked from nay0648/unified2021

A UNIFIED SPEECH ENHANCEMENT FRONT-END FOR ONLINE DEREVERBERATION, ACOUSTIC ECHO CANCELLATION, AND SOURCE SEPARATION

MATLAB 1 Updated Apr 9, 2021
speechbrain Public
Forked from speechbrain/speechbrain

A PyTorch-based Speech Toolkit

Python Apache License 2.0 Updated Mar 17, 2021
warp-rnnt Public
Forked from 1ytic/warp-rnnt

CUDA-Warp RNN-Transducer with TensorFlow and PyTorch binding.

Python 1 MIT License Updated Mar 10, 2021
athena Public
Forked from athena-team/athena

an open-source implementation of sequence-to-sequence based speech processing engine

Python Apache License 2.0 Updated Nov 12, 2020
SV-GMM Public

Speaker Verification using GMMs

Python 1 Updated Sep 27, 2020
cat_tensorflow Public

Crf-based Asr Toolkit with TensorFlow implement

tensorflow speech-recognition speech-to-text asr ctc ctc-crf

Python 8 6 Updated Aug 16, 2020

Lucky Wong TeaPoly

Achievements

Achievements

DeepSeekV3MoE Public

Uh oh!

CE-OptimizedLoss Public

Uh oh!

matmulfreellm Public

Uh oh!

wenet Public

Uh oh!

you-get Public

Uh oh!

NSD-MS2S Public

Uh oh!

AIF-PyTorch Public

Uh oh!

CTC-OptimizedLoss Public

Uh oh!

PLCPA-ASYM-Loss Public

Uh oh!

self_attention_alignment Public

Uh oh!

nara_wpe Public

Uh oh!

TeaPoly.github.io Public

Uh oh!

FunASR Public

Uh oh!

icefall Public

Uh oh!

speexdsp-ns-python Public

Uh oh!

Conformer-Athena Public

Uh oh!

NKF-AEC Public

Uh oh!

torchdistance Public

Uh oh!

asr_frontend Public

Uh oh!

PercepNet Public

Uh oh!

SpectrumAugmenter Public

Uh oh!

rir-configuration-generator Public

Uh oh!

warp-ctc-crf Public

Uh oh!

py-aec-unified2021 Public

Uh oh!

unified2021 Public

Uh oh!

speechbrain Public

Uh oh!

warp-rnnt Public

Uh oh!

athena Public

Uh oh!

SV-GMM Public

Uh oh!

cat_tensorflow Public

Uh oh!