- Atlanta
Stars
A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge models like YOLO11, RT-DETR, SAM …
Examples, end-2-end tutorials and apps built using Liquid AI Foundational Models (LFM) and the LEAP SDK
Create automatic playlists by using Deep Learning to *listen* to the music.
Code for the paper Hybrid Spectrogram and Waveform Source Separation
A paper list for spatial reasoning
Quick illustration of how one can easily read books together with LLMs. It's great and I highly recommend it.
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
The simplest, fastest repository for training/finetuning medium-sized GPTs.
🔊 Text-Prompted Generative Audio Model
Code for "A Computational Analysis of Real-World DJ Mixes using Mix-To-Track Subsequence Alignment" ISMIR 2020
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
NazarPonochevnyi / LSTM-Music-Genre-Classification
Forked from ruohoruotsi/LSTM-Music-Genre-ClassificationMusic genre classification with LSTM Recurrent Neural Nets
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
adefossez / demucs
Forked from facebookresearch/demucsCode for the paper Hybrid Spectrogram and Waveform Source Separation
AutoMashup: an open-source tool for automatic generation of music mashups. Based on open-source deep learning tools.
tmgsr02 / macrobasev
Forked from stanford-futuredata/macrobaseMacroBase: A Search Engine for Fast Data
MacroBase: A Search Engine for Fast Data
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!
Muzic: Music Understanding and Generation with Artificial Intelligence
A curated list of awesome things related to shadcn/ui.
StreamingVLM: Real-Time Understanding for Infinite Video Streams
Introduction to Machine Learning Systems