Working on Signal Processing and Machine Learning.
-
NVIDIA
- Santa Clara, CA
- spturtle.blogspot.com
Stars
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Open-source reproducible benchmarks from Argmax
Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196
Simple tutorials using Google's TensorFlow Framework
Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data