ruclion

Follow

户建坤 ruclion

Follow

喜欢威斯布鲁克和米切尔=.=

49 followers · 64 following

Tsinghua University
深圳
https://blog.csdn.net/u013625492

Achievements

Achievements

Stars

Daisyqk / Automatic-Prosody-Annotation

Python 111 52 Updated Apr 6, 2022

stefanrmmr / streamlit-audio-recorder

Record Audio from the User's Microphone in Apps that are Deployed to the Web. (via Browser Media-API, REACT-based, Streamlit Custom Component)

TypeScript 494 98 Updated Sep 11, 2023

yatengLG / Focal-Loss-Pytorch

全中文注释.(The loss function of retinanet based on pytorch).(You can use it on one-stage detection task or classifical task, to solve data imbalance influence).用于one-stage目标检测算法,提升检测效果.你也可以在分类任务中使用该损失函…

Jupyter Notebook 491 112 Updated Oct 9, 2025

lrfasd / lrfasd.github.io

HTML 46 18 Updated Oct 10, 2025

nladuo / AI_beatmap_generator

尝试使用神经网络生成音乐游戏Malody的谱面。

Jupyter Notebook 51 13 Updated Feb 19, 2020

rogersce / cnpy

library to read/write .npy and .npz files in C/C++

C++ 1,440 326 Updated Jan 18, 2023

nttcslab-sp / kaldiio

A pure python module for reading and writing kaldi ark files

Python 267 37 Updated Mar 6, 2025

tensorflow / tensorflow

An Open Source Machine Learning Framework for Everyone

C++ 192,591 75,002 Updated Nov 28, 2025

serizba / cppflow

Run TensorFlow models in C++ without installation and without Bazel

C++ 809 181 Updated Aug 16, 2024

jtkim-kaist / ram_modified

"Recurrent Models of Visual Attention" in TensorFlow

Python 41 9 Updated Apr 13, 2017

srvk / DiViMe

ACLEW Diarization Virtual Machine

Shell 34 9 Updated Jul 29, 2019

jtkim-kaist / Speech-enhancement

Deep neural network based speech enhancement toolkit

MATLAB 217 62 Updated Jun 14, 2019

jtkim-kaist / VAD

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

MATLAB 866 234 Updated Jun 9, 2021

zhaoyi2 / audio_augment

A tool/script for batch speech data enhancement with speed/volume/RIRS/MUSAN

Shell 24 5 Updated Jun 28, 2020

shamim-hussain / musan_investigation_cnn_rnn

Evaluation of the classification performance (Speech, Music, and Noise) of 1D (WaveNet) and 2D (MobileNet) CNN and RNN (GRU) on the MUSAN corpus.

Python 14 10 Updated Sep 23, 2020

usc-sail / mica-speech-activity-detection

Robust Speech Activity Detection (SAD) in movie audio

Python 26 10 Updated Jan 27, 2021

RicherMans / Datadriven-GPVAD

The codebase for Data-driven general-purpose voice activity detection.

Python 94 23 Updated Aug 3, 2023

RicherMans / GPV

Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper

Python 141 29 Updated Aug 3, 2023

iiscleap / DIHARD_2019_baseline_alltracks

Perl 38 12 Updated May 16, 2022

nryant / dscore

Diarization scoring tools.

Python 260 46 Updated Mar 28, 2023

wiseman / py-webrtcvad

Python interface to the WebRTC Voice Activity Detector

C 2,405 424 Updated Jul 4, 2024

hbredin / DomainAdversarialVoiceActivityDetection

Code for reproducing experiments in "Domain-Adversarial Voice Activity Detection"

Jupyter Notebook 23 4 Updated Mar 3, 2020

snakers4 / silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 7,466 678 Updated Nov 25, 2025

tjdgns0928 / MultiTarget_VAD

Representation of Paper: On training targets for noise-robust voice activity detection.

Jupyter Notebook 5 2 Updated Jun 17, 2021

qiuqiangkong / panns_transfer_to_gtzan

Python 111 45 Updated Jul 12, 2020

YuanGongND / psla

Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".

Python 149 15 Updated Jul 13, 2023

YuanGongND / ast

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

Jupyter Notebook 1,386 237 Updated May 21, 2023

jim-schwoebel / voicebook

🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).

Python 387 88 Updated Dec 8, 2022

jim-schwoebel / sound_event_detection

🎵 A repository for manually annotating files to create labeled acoustic datasets for machine learning.

Python 46 3 Updated Feb 20, 2022

jim-schwoebel / audioset_models

📊 Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).

Python 31 12 Updated Jun 17, 2024