Skip to content
View woojeonghippo's full-sized avatar

Block or report woojeonghippo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Audio Large Language Models

Python 848 43 Updated Jul 5, 2025

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 3,991 291 Updated Jan 5, 2026

Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deplo…

C 1,176 211 Updated Dec 17, 2025

A nearly-live implementation of OpenAI's Whisper.

Python 3,732 512 Updated Jan 13, 2026

A PyTorch native platform for training generative AI models

Python 4,964 667 Updated Jan 15, 2026

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 12,174 1,136 Updated Jan 15, 2026

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,877 345 Updated Jan 4, 2024

Examples in the MLX framework

Python 8,130 1,122 Updated Dec 15, 2025

Vector (and Scalar) Quantization, in Pytorch

Python 3,822 313 Updated Jan 13, 2026

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 19,638 2,107 Updated Oct 21, 2025

A nanoGPT pipeline packed in a spreadsheet

2,142 129 Updated Jun 17, 2024