📎
Clippy clips back!
Stars
ml
36 repositories
QLoRA: Efficient Finetuning of Quantized LLMs
Universal LLM Deployment Engine with ML Compilation
A simple LLM chat front-end that makes it easy to find, download, and mess around with models on your local machine.
Distribute and run LLMs with a single file.
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.