Stars
JSON parser for streaming objects live from an LLM's output
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
LLM-powered multiagent persona simulation for imagination enhancement and business insights.
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Foundational model for human-like, expressive TTS
Inference and training library for high-quality TTS models.
π€ π¬ Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
π Text-Prompted Generative Audio Model
A generative speech model for daily dialogue.
Open Source framework for voice and multimodal conversational AI
Unofficial implementation of the paper "The Chosen One: Consistent Characters in Text-to-Image Diffusion Models"
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
Automate browser based workflows with AI
Host WordPress sites on Vercel, Netlify, or AWS Lambda
Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents
Rapid prototyping of web apps, using a chat interface πΆπ€
CLI platform to experiment with codegen. Precursor to: https://lovable.dev
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
Monitor Memory usage of Python code
A high-performance, zero-overhead, extensible Python compiler with built-in NumPy support
Fast inference engine for Transformer models
Human-readable explanations for what is taxable on your crypto transactions.
Automatically exported from code.google.com/p/m2m-aligner
Self-Supervised Speech Pre-training and Representation Learning Toolkit