-
Shanghai University
- Shanghai, China
-
19:02
(UTC +08:00)
Highlights
- Pro
Stars
Self-supervised learning for real-time pitch estimation
Muzic: Music Understanding and Generation with Artificial Intelligence
Symbolic Music Generation with Diffusion Models
A chord identifier and harmonizer for MIDI files
Official code for Symbolic Music Generation with Non-Differentiable Rule Guided Diffusion (ICML 2024, Oral).
A large-scale dataset of caption-annotated MIDI files.
Text2midi is the first end-to-end model for generating MIDI files from textual descriptions. By leveraging pretrained large language models and a powerful autoregressive transformer decoder, text2m…
Official PyTorch implementation of BigVGAN (ICLR 2023)
Algorithm and Data for paper "Automatic Detection of Hierarchical Structure and Influence of Structure on Melody, Harmony and Rhythm in Popular Music"
Official implementation of compound word transformer (AAAI'21)
Beginner Tutorial to EC2-VAE and Poly-Dis. Designed for ICM.
The repository of the paper: Wang et al., Learning interpretable representation for controllable polyphonic music generation, ISMIR 2020.
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
This is the dataset repository for the paper: POP909: A Pop-song Dataset for Music Arrangement Generation
AudioLDM: Generate speech, sound effects, music and beyond, with text.
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Pytorch implementation of MeanFlow on ImageNet and CIFAR10
Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"
[Pattern Recognition 25] CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks
This is the official implementation of MusER (AAAI'24).
Stable Diffusion web UI