Skip to content
View hcynomo's full-sized avatar

Block or report hcynomo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Intel® RealSense™ SDK

C++ 1 Updated Jan 8, 2026

Linux kernel source tree

C 213,754 59,918 Updated Jan 12, 2026

drawYUV

JavaScript 3 Updated Mar 22, 2024

UFO³: Weaving the Digital Agent Galaxy

Python 7,937 971 Updated Jan 6, 2026

PantoMatrix: Generating Face and Body Animation from Speech

Python 1,165 185 Updated Jan 16, 2025

Implement AES(Advanced Encryption Standard) Stystem in C program

C 9 5 Updated May 28, 2019

Code and dataset for photorealistic Codec Avatars driven from audio

Python 2,852 281 Updated Sep 15, 2024

AudioLDM training, finetuning, evaluation and inference.

Python 290 57 Updated Dec 13, 2024

Text-to-Audio/Music Generation

Python 2,557 203 Updated Sep 29, 2024

PromptInject is a framework that assembles prompts in a modular fashion to provide a quantitative analysis of the robustness of LLMs to adversarial prompt attacks. 🏆 Best Paper Awards @ NeurIPS ML …

Python 447 43 Updated Feb 26, 2024

A curated list of awesome vision and language resources for earth observation.

252 19 Updated Mar 23, 2025

The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)

Jupyter Notebook 1,397 438 Updated Feb 7, 2023

Core Engine of Singing Voice Conversion & Singing Voice Clone

Python 2,845 923 Updated Apr 23, 2024

リアルタイムボイスチェンジャー Realtime Voice Changer

Python 19,484 2,217 Updated Aug 24, 2025

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Python 2,513 524 Updated Jun 13, 2025

Code and models for ICML 2024 paper, NExT-GPT: Any-to-Any Multimodal Large Language Model

Python 3,607 359 Updated May 13, 2025

A python project that uses several standard/otherwise very common libraries to determine the key that a song (an .mp3) is in, i.e. F major or C# minor, with annotations and some examples.

Jupyter Notebook 177 25 Updated Jul 18, 2020

A Machine Learning Approach of Emotional Model

Python 246 63 Updated Aug 5, 2024

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 44,211 5,908 Updated Aug 16, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,947 2,688 Updated Dec 15, 2025

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

Python 7,963 783 Updated Feb 11, 2024

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 59,194 9,411 Updated Dec 15, 2025

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 11,736 1,170 Updated Nov 14, 2024

Official PyTorch implementation of Contrastive Learning of Musical Representations

Python 335 51 Updated Jul 25, 2024

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 18,144 2,678 Updated Nov 3, 2025

Official implementation of the paper "Acoustic Music Understanding Model with Large-Scale Self-supervised Training".

Python 422 27 Updated May 25, 2025

Traditional Mandarin LLMs for Taiwan

Python 1,386 116 Updated Apr 20, 2025

Pronounced as "musician", musicnn is a set of pre-trained deep convolutional neural networks for music audio tagging.

Jupyter Notebook 669 101 Updated Dec 11, 2023
Next