Skip to content
View jake-drysdale's full-sized avatar

Block or report jake-drysdale

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 2,064 138 Updated Dec 18, 2025

MemVerse: Multimodal Memory for Lifelong Learning Agents

Python 103 4 Updated Jan 6, 2026

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 32,389 6,674 Updated Jan 9, 2026

Repository for the paper "Combining audio control and style transfer using latent diffusion", accepted at ISMIR 2024

Jupyter Notebook 61 9 Updated Feb 19, 2025

eurorack / pi codec

C 232 30 Updated Aug 26, 2019

BandCondiNet: Parallel Transformers-based Conditional Popular Music Generation with Multi-View Features

Python 3 2 Updated Oct 18, 2025

A python framework for symbolic music generation, evaluation and analysis

Python 185 17 Updated Jun 15, 2023

State of the Art of Music Generation with Deep Learning and AI

288 27 Updated Mar 16, 2023

⚡ Finetune Wa2vec 2.0 For Speech Recognition

Python 145 32 Updated Feb 6, 2025

A repo that builds text to music datasets from scratch, used in MuseContorlLite [ICML2025]

Python 27 2 Updated May 20, 2025

Code for the paper “Automatic Music Sample Identification with Multi-Track Contrastive Learning”.

Python 14 1 Updated Oct 24, 2025

MuseControlLite: Multifunctional Music Generation with Lightweight Conditioners [ICML 2025]

Python 49 7 Updated Jan 6, 2026

Nodes for image juxtaposition for Flux in ComfyUI

Python 1,394 56 Updated Jan 9, 2025

"Fx-Encoder++: Extracting Instrument-wise Audio Effect Representations from Mixtures"

Python 43 1 Updated Aug 23, 2025

Official repo of On Exact Inversion of DPM-Solvers by Hong et al, in CVPR 2024.

Python 76 1 Updated Jun 11, 2024

JAM: A Tiny Flow-based Song Generator with Fine-grained Controllability and Aesthetic Alignment

Python 145 18 Updated Aug 7, 2025

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

Python 1,125 99 Updated Nov 24, 2025

A deep learning project for automated chorus detection in songs, featuring a command-line interface (CLI) tool that allows users to input a YouTube link and utilize a pre-trained CRNN model to dete…

Jupyter Notebook 46 7 Updated May 21, 2025

AI tool for full-song music production within REAPER digital audio workstation.

Python 9 Updated Aug 9, 2025

Digital Catalog of Florence Price's Songs with Metadata

Python 10 Updated Jul 10, 2025

[NeurIPS 2023] UniPC: A Unified Predictor-Corrector Framework for Fast Sampling of Diffusion Models

Jupyter Notebook 350 15 Updated Sep 22, 2023

MelodySim: Measuring Melody-aware Music Similarity for Plagiarism Detection

Python 16 Updated May 29, 2025
Python 635 63 Updated Nov 10, 2025

Memory layer for AI Agents. Replace complex RAG pipelines with a serverless, single-file memory layer. Give your agents instant retrieval and long-term memory.

Rust 11,831 984 Updated Jan 9, 2026

[NeurIPS 2025 D&B] Open-source Multi-agent Poster Generation from Papers

Python 3,029 206 Updated Dec 21, 2025

ACE-Step: A Step Towards Music Generation Foundation Model

Python 3,593 438 Updated Jun 27, 2025

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 13,908 2,050 Updated Dec 26, 2025

Di♪♪Rhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion

Python 2,194 255 Updated Nov 27, 2025

PixelHacker: Image Inpainting with Structural and Semantic Consistency

Python 467 19 Updated May 20, 2025

[TMLR 2025🔥] A survey for the autoregressive models in vision.

777 22 Updated Nov 8, 2025
Next