PromptInject is a framework that assembles prompts in a modular fashion to provide a quantitative analysis of the robustness of LLMs to adversarial prompt attacks. 🏆 Best Paper Awards @ NeurIPS ML …

Python 447 43 Updated Feb 26, 2024

geoaigroup / awesome-vision-language-models-for-earth-observation

A curated list of awesome vision and language resources for earth observation.

252 19 Updated Mar 23, 2025

MiteshPuthran / Speech-Emotion-Analyzer

The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)

Jupyter Notebook 1,397 438 Updated Feb 7, 2023

PlayVoice / whisper-vits-svc

Core Engine of Singing Voice Conversion & Singing Voice Clone

Python 2,845 923 Updated Apr 23, 2024

w-okada / voice-changer

リアルタイムボイスチェンジャー Realtime Voice Changer

Python 19,484 2,217 Updated Aug 24, 2025

s3prl / s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Python 2,513 524 Updated Jun 13, 2025

NExT-GPT / NExT-GPT

Code and models for ICML 2024 paper, NExT-GPT: Any-to-Any Multimodal Large Language Model

Python 3,607 359 Updated May 13, 2025

jackmcarthur / musical-key-finder

A python project that uses several standard/otherwise very common libraries to determine the key that a song (an .mp3) is in, i.e. F major or C# minor, with annotations and some examples.

Jupyter Notebook 177 25 Updated Jul 18, 2020

danyalimran93 / Music-Emotion-Recognition

A Machine Learning Approach of Emotional Model

Python 246 63 Updated Aug 5, 2024

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 44,211 5,908 Updated Aug 16, 2024

microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,947 2,688 Updated Dec 15, 2025

Plachtaa / VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

Python 7,963 783 Updated Feb 11, 2024

CorentinJ / Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 59,194 9,411 Updated Dec 15, 2025

facebookresearch / seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 11,736 1,170 Updated Nov 14, 2024

Spijkervet / CLMR

Official PyTorch implementation of Contrastive Learning of Musical Representations

Python 335 51 Updated Jul 25, 2024

qiuqiangkong / audioset_tagging_cnn

Python 1,651 296 Updated Jul 25, 2024

meta-llama / llama-cookbook

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 18,144 2,678 Updated Nov 3, 2025

yizhilll / MERT

Official implementation of the paper "Acoustic Music Understanding Model with Large-Scale Self-supervised Training".

Python 422 27 Updated May 25, 2025

MiuLab / Taiwan-LLM

Traditional Mandarin LLMs for Taiwan

Python 1,386 116 Updated Apr 20, 2025

jordipons / musicnn

Pronounced as "musician", musicnn is a set of pre-trained deep convolutional neural networks for music audio tagging.

Jupyter Notebook 669 101 Updated Dec 11, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hcynomo

Block or report hcynomo

Stars

qualcomm-linux / meta-qcom-robotics-sdk

Kontra2B / librealsense

torvalds / linux

JiasHuang / drawYUV

microsoft / UFO

PantoMatrix / PantoMatrix

Yunyung / Cryptography-AES-implement-in-C

facebookresearch / audio2photoreal

haoheliu / AudioLDM-training-finetuning

haoheliu / AudioLDM2

agencyenterprise / PromptInject