-
aasist Public
Forked from clovaai/aasistOfficial PyTorch implementation of "AASIST: Audio Anti-Spoofing using Integrated Spectro-Temporal Graph Attention Networks"
Python MIT License UpdatedFeb 11, 2025 -
NeMo Public
Forked from NVIDIA-NeMo/NeMoNeMo: a toolkit for conversational AI
Python Apache License 2.0 UpdatedNov 10, 2024 -
AudioLDM2 Public
Forked from haoheliu/AudioLDM2Text-to-Audio/Music Generation
Python Other UpdatedSep 29, 2024 -
SSL_Anti-spoofing Public
Forked from TakHemlata/SSL_Anti-spoofingThis repository includes the code to reproduce our paper "Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation".
Python MIT License UpdatedAug 23, 2024 -
-
s3prl Public
Forked from s3prl/s3prlSelf-Supervised Speech Pre-training and Representation Learning Toolkit
Python Apache License 2.0 UpdatedDec 29, 2023 -
ijepa Public
Forked from facebookresearch/ijepaOfficial codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive arch…
Python Other UpdatedOct 14, 2023 -
VALL-E-X Public
Forked from Plachtaa/VALL-E-XAn open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
Python MIT License UpdatedOct 5, 2023 -
ToMe Public
Forked from facebookresearch/ToMeA method to increase the speed and lower the memory footprint of existing vision transformers.
Python Other UpdatedSep 5, 2023 -
vall-e Public
Forked from lifeiteng/vall-ePyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
Python Apache License 2.0 UpdatedAug 8, 2023 -
vicreg Public
Forked from facebookresearch/vicregVICReg official code base
Python MIT License UpdatedJul 6, 2023