Lists (1)
Sort Name ascending (A-Z)
Starred repositories
💫 Toolkit to help you get started with Spec-Driven Development
A self hosted virtual browser that runs in docker and uses WebRTC.
Tool to make high quality text to speech (tts) corpus from audio + text books.
Minimal reproduction of DeepSeek R1-Zero
📦 Repomix is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools lik…
Automagically reverse-engineer REST APIs via capturing traffic
Convert PDF to markdown + JSON quickly with high accuracy
verl: Volcano Engine Reinforcement Learning for LLMs
The open source developer platform to build AI/LLM applications and models with confidence. Enhance your AI applications with end-to-end tracking, observability, and evaluations, all in one integra…
A de-minifier (formatter, exploder, beautifier) for shell one-liners
Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Phoneme alignment representation compatible with multiple forced aligners
Transcription, forced alignment, and audio indexing with OpenAI's Whisper
Official implementation of paper: Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-to-Speech
Cross-platform speech toolset, used from the command-line or as a Node.js library. Includes a variety of engines for speech synthesis, speech recognition, forced alignment, speech translation, voic…
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
Label Studio is a multi-type data labeling and annotation tool with standardized output format
MARS5 speech model (TTS) from CAMB.AI
[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
Downloadable snapshots of the Chrome Top Million Websites pulled from public CrUX data in Google BigQuery.
Fabric is an open-source framework for augmenting humans using AI. It provides a modular system for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.
OpenAPI linting, diffing and testing. Optic helps prevent breaking changes, publish accurate documentation and improve the design of your APIs.
A massively parallel, high-level programming language