jake-drysdale

Jake jake-drysdale

Music Tech and Machine Learning - PhD from Birmingham City University - Researcher @Beatoven

74 followers · 46 following

Stars

facebookresearch / perception_models

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 2,064 138 Updated Dec 18, 2025

KnowledgeXLab / MemVerse

MemVerse: Multimodal Memory for Lifelong Learning Agents

Python 103 4 Updated Jan 6, 2026

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 32,389 6,674 Updated Jan 9, 2026

NilsDem / control-transfer-diffusion

Repository for the paper "Combining audio control and style transfer using latent diffusion", accepted at ISMIR 2024

Jupyter Notebook 61 9 Updated Feb 19, 2025

mxmxmx / terminal_tedium

eurorack / pi codec

C 232 30 Updated Aug 26, 2019

Chinglohsiu / BandCondiNet

BandCondiNet: Parallel Transformers-based Conditional Popular Music Generation with Multi-View Features

Python 3 2 Updated Oct 18, 2025

carlosholivan / musicaiz

A python framework for symbolic music generation, evaluation and analysis

Python 185 17 Updated Jun 15, 2023

carlosholivan / DeepLearningMusicGeneration

State of the Art of Music Generation with Deep Learning and AI

288 27 Updated Mar 16, 2023

khanld / ASR-Wav2vec-Finetune

⚡ Finetune Wa2vec 2.0 For Speech Recognition

Python 145 32 Updated Feb 6, 2025

fundwotsai2001 / Text-to-music-dataset-preparation

A repo that builds text to music datasets from scratch, used in MuseContorlLite [ICML2025]

Python 27 2 Updated May 20, 2025

sony / sampleid

Code for the paper “Automatic Music Sample Identification with Multi-Track Contrastive Learning”.

Python 14 1 Updated Oct 24, 2025

fundwotsai2001 / MuseControlLite

MuseControlLite: Multifunctional Music Generation with Lightweight Conditioners [ICML 2025]

Python 49 7 Updated Jan 6, 2026

logtd / ComfyUI-Fluxtapoz

Nodes for image juxtaposition for Flux in ComfyUI

Python 1,394 56 Updated Jan 9, 2025

SonyResearch / Fx-Encoder_PlusPlus

"Fx-Encoder++: Extracting Instrument-wise Audio Effect Representations from Mixtures"

Python 43 1 Updated Aug 23, 2025

smhongok / inv-dpm

Official repo of On Exact Inversion of DPM-Solvers by Hong et al, in CVPR 2024.

Python 76 1 Updated Jun 11, 2024

declare-lab / jamify

JAM: A Tiny Flow-based Song Generator with Fine-grained Controllability and Aesthetic Alignment

Python 145 18 Updated Aug 7, 2025

iver56 / torch-audiomentations

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

Python 1,125 99 Updated Nov 24, 2025

dennisvdang / chorus-detection

A deep learning project for automated chorus detection in songs, featuring a command-line interface (CLI) tool that allows users to input a YouTube link and utilize a pre-trained CRNN model to dete…

Jupyter Notebook 46 7 Updated May 21, 2025

heqi201255 / TOMI

AI tool for full-song music production within REAPER digital audio workstation.

Python 9 Updated Aug 9, 2025

TT515 / Florence_Price_Art_Song_Dataset

Digital Catalog of Florence Price's Songs with Metadata

Python 10 Updated Jul 10, 2025

wl-zhao / UniPC

[NeurIPS 2023] UniPC: A Unified Predictor-Corrector Framework for Fast Sampling of Diffusion Models

Jupyter Notebook 350 15 Updated Sep 22, 2023

AMAAI-Lab / MelodySim

MelodySim: Measuring Melody-aware Music Similarity for Plagiarism Detection

Python 16 Updated May 29, 2025

fluxions-ai / vui

Python 635 63 Updated Nov 10, 2025

memvid / memvid

Memory layer for AI Agents. Replace complex RAG pipelines with a serverless, single-file memory layer. Give your agents instant retrieval and long-term memory.

Rust 11,831 984 Updated Jan 9, 2026

Paper2Poster / Paper2Poster

[NeurIPS 2025 D&B] Open-source Multi-agent Poster Generation from Papers

Python 3,029 206 Updated Dec 21, 2025

ace-step / ACE-Step

ACE-Step: A Step Towards Music Generation Foundation Model

Python 3,593 438 Updated Jun 27, 2025

SWivid / F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 13,908 2,050 Updated Dec 26, 2025

ASLP-lab / DiffRhythm

Di♪♪Rhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion

Python 2,194 255 Updated Nov 27, 2025

hustvl / PixelHacker

PixelHacker: Image Inpainting with Structural and Semantic Consistency

Python 467 19 Updated May 20, 2025

ChaofanTao / Autoregressive-Models-in-Vision-Survey

[TMLR 2025🔥] A survey for the autoregressive models in vision.

777 22 Updated Nov 8, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly