WangHelin1997

Follow

🎯

Focusing

Helin Wang WangHelin1997

🎯

Focusing

Follow

A PhD candidate at Johns Hopkins University, interested in AI for Audio & Speech Processing.

233 followers · 66 following

THU & PKU & JHU
Baltimore, US
https://wanghelin1997.github.io/helinwang
in/helin-wang-2a74671b3
https://scholar.google.com/citations?user=I_V0zBMAAAAJ
https://huggingface.co/OpenSound

Achievements

Achievements

WangHelin1997/README.md

Hi there 👋

🎓 I'm a PhD student at Johns Hopkins University, where I research speech and audio generation with AI.

🏠 Learn more about my research and projects on my homepage.

🎮 Try live demos of our latest models on OpenSound Spaces.

Pinned Loading

CapSpeech CapSpeech Public

CapSpeech: Enabling Downstream Applications in Style-Captioned Text-to-Speech

Jupyter Notebook 365 41
SoloSpeech SoloSpeech Public

SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative Pipeline

Python 279 31
SSR-Speech SSR-Speech Public

SSR-Speech: Towards Stable, Safe and Robust Zero-shot Speech Editing and Synthesis

Python 141 16
SoloAudio SoloAudio Public

SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.

Python 106 12
SpeechTasks SpeechTasks Public

This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent speech tool development, and speech applications.

80 7
MaskSpec MaskSpec Public

The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training

Python 42 8