Skip to content
View LoyiAkira's full-sized avatar
  • Shanghai University
  • Shanghai, China
  • 19:02 (UTC +08:00)

Highlights

  • Pro

Block or report LoyiAkira

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Self-supervised learning for real-time pitch estimation

Python 258 22 Updated Oct 15, 2025

Muzic: Music Understanding and Generation with Artificial Intelligence

Python 4,856 488 Updated Oct 12, 2024
Python 9 2 Updated Apr 15, 2024
Python 2 Updated Jan 6, 2025

Symbolic Music Generation with Diffusion Models

Python 265 36 Updated Aug 22, 2025

A chord identifier and harmonizer for MIDI files

Python 97 8 Updated Dec 4, 2020

Official code for Symbolic Music Generation with Non-Differentiable Rule Guided Diffusion (ICML 2024, Oral).

Python 84 8 Updated Aug 12, 2024

A large-scale dataset of caption-annotated MIDI files.

Python 75 3 Updated Jul 23, 2024

Text2midi is the first end-to-end model for generating MIDI files from textual descriptions. By leveraging pretrained large language models and a powerful autoregressive transformer decoder, text2m…

Python 121 15 Updated Feb 28, 2025

Official PyTorch implementation of BigVGAN (ICLR 2023)

Python 1,118 141 Updated Sep 5, 2024

Algorithm and Data for paper "Automatic Detection of Hierarchical Structure and Influence of Structure on Melody, Harmony and Rhythm in Popular Music"

Python 98 11 Updated Oct 5, 2022

Official implementation of compound word transformer (AAAI'21)

Python 278 44 Updated Nov 27, 2023

Beginner Tutorial to EC2-VAE and Poly-Dis. Designed for ICM.

Jupyter Notebook 15 3 Updated Nov 17, 2021

The repository of the paper: Wang et al., Learning interpretable representation for controllable polyphonic music generation, ISMIR 2020.

Python 43 12 Updated Mar 22, 2024

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 31,301 6,426 Updated Oct 18, 2025
Python 47 3 Updated Apr 3, 2024
Python 5 Updated May 26, 2025

This is the dataset repository for the paper: POP909: A Pop-song Dataset for Music Arrangement Generation

Python 347 44 Updated Aug 28, 2020

Text-to-Audio/Music Generation

Python 2,506 203 Updated Sep 29, 2024

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Python 2,753 249 Updated Jun 25, 2025

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 11,172 1,098 Updated Aug 27, 2025

Pytorch implementation of MeanFlow on ImageNet and CIFAR10

Python 306 17 Updated Aug 23, 2025

Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"

Python 988 58 Updated Mar 12, 2024

[Pattern Recognition 25] CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks

Jupyter Notebook 445 26 Updated Mar 1, 2025

只需几步即可学习构建一款软件

Python 1,210 52 Updated Sep 26, 2025

AI-based Audio Watermarking Tool

Python 286 38 Updated Jan 7, 2024

This is the official implementation of MusER (AAAI'24).

Python 30 1 Updated Jun 4, 2025

Stable Diffusion web UI

Python 157,331 29,207 Updated Oct 7, 2025
Next