Skip to content
View fynnomenon's full-sized avatar

Highlights

  • Pro

Organizations

@AI-in-Practice-UOS @deeplair-dev

Block or report fynnomenon

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Vogent Turn: fast, open-source turn-detection for Voice AI applications

Python 38 2 Updated Oct 28, 2025

SoulX-Podcast is an inference codebase by the Soul AI team for generating high-fidelity podcasts from text.

Python 2,331 272 Updated Nov 12, 2025

Long-form streaming TTS system for multi-speaker dialogue generation

Python 1,232 111 Updated Oct 26, 2025

Awesome Neural Codec Models, Text-to-Speech Synthesizers & Speech Language Models

Python 199 13 Updated Nov 12, 2025

A Conversational Speech Generation Model

Python 14,314 1,450 Updated May 27, 2025

Create Videos with Code

TypeScript 3,532 164 Updated May 9, 2025
Python 313 19 Updated Aug 28, 2025

Mellea is a library for writing generative programs.

Python 236 53 Updated Nov 24, 2025

Simple MCP-UI widgets for common use cases.

HTML 52 6 Updated Aug 26, 2025

AG-UI: the Agent-User Interaction Protocol. Bring Agents into Frontend Applications.

TypeScript 9,960 921 Updated Nov 26, 2025

🤖 WebMCP 🧪

379 18 Updated Nov 17, 2025

UI over MCP. Create next-gen UI experiences with the protocol and SDK!

TypeScript 3,610 255 Updated Nov 24, 2025

Example apps for the Apps SDK

JavaScript 1,722 361 Updated Nov 20, 2025

On-device TTS model by Neuphonic

Python 4,080 412 Updated Nov 18, 2025

Bringing the power of MCP to the web

TypeScript 937 63 Updated Oct 7, 2025

MiMo-Audio: Audio Language Models are Few-Shot Learners

Python 864 86 Updated Sep 20, 2025

Build an email assistant with human-in-the-loop and memory

Jupyter Notebook 1,375 293 Updated Oct 20, 2025

Chat with your Letta agents over a low-latency voice connection. Advanced voice mode, but with advanced memory.

Python 17 10 Updated Jun 11, 2025

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…

C++ 9,017 1,000 Updated Nov 26, 2025

[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

Jupyter Notebook 1,177 164 Updated Nov 24, 2025

Step-Audio 2 is an end-to-end multi-modal large language model designed for industry-strength audio understanding and speech conversation.

Python 1,235 91 Updated Sep 22, 2025

State-of-the-art TTS model under 25MB 😻

Python 9,126 459 Updated Aug 23, 2025

SOTA Open Source TTS

Python 24,180 1,974 Updated Nov 6, 2025

Interface for OuteTTS models.

Python 1,408 114 Updated Jun 21, 2025
Python 2,629 332 Updated Nov 25, 2025

Fast and local neural text-to-speech engine

C++ 1,768 183 Updated Nov 12, 2025

Flexible and powerful framework for managing multiple AI agents and handling complex conversations

Python 7,093 651 Updated Nov 19, 2025

Next Generation Agentic Proxy for AI Agents and MCP servers

Rust 1,370 205 Updated Nov 26, 2025

A lightweight end-of-utterance detection model fine-tuned on SmolLM2-135M, optimized for Raspberry Pi and low-power devices.

Jupyter Notebook 36 2 Updated Nov 8, 2025
Next