Skip to content
View OLH21's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report OLH21

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A free, open source, and extensible speech-to-text application that works completely offline.

TypeScript 6,840 448 Updated Nov 18, 2025
JavaScript 8 3 Updated Oct 31, 2025

Frontier CoreML audio models in your apps — text-to-speech, speech-to-text, voice activity detection, and speaker diarization. In Swift, powered by SOTA open source.

Swift 891 111 Updated Nov 18, 2025

Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed.

Python 10,400 884 Updated Oct 12, 2025

Home Assistant MCP Server

Python 232 32 Updated Aug 5, 2025

🎨 Turn your roughest sketches into stunning 3D worlds by vibe drawing

TypeScript 1,962 283 Updated Jul 3, 2025

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 11,702 1,165 Updated Nov 14, 2024
Python 14,169 1,369 Updated Nov 18, 2025

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 19,466 3,043 Updated Nov 18, 2025

Create images of a given character in different poses

Python 702 86 Updated Jun 5, 2024

Arbitrary-steps Image Super-resolution via Diffusion Inversion (CVPR 2025)

Python 1,309 84 Updated Aug 5, 2025
Python 179 21 Updated Nov 6, 2025

On-device Image Generation for Apple Silicon

Python 665 40 Updated Apr 11, 2025

Independent technology for modern publishing, memberships, subscriptions and newsletters.

JavaScript 51,197 11,164 Updated Nov 18, 2025

Python tool for converting files and office documents to Markdown.

Python 83,086 4,721 Updated Oct 20, 2025

ComfyUI-OmniGen - A ComfyUI custom node implementation of OmniGen, a powerful text-to-image generation and editing model.

Python 296 21 Updated Apr 18, 2025

Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The …

TypeScript 26,255 2,726 Updated Nov 16, 2025

The desktop app for ComfyUI (Windows & macOS)

TypeScript 1,870 162 Updated Nov 18, 2025

[WACV 2025] Official implementation of "Face Anonymization Made Simple"

Jupyter Notebook 191 19 Updated Jun 25, 2025

Awesome apps, software, and SaaS deals on Black Friday.

6,370 1,392 Updated Nov 18, 2025

ML-powered speech recognition directly in your browser

TypeScript 3,150 401 Updated Oct 1, 2024

A set of ComfyUI nodes for using models served by fal.ai and Replicate.com

Python 37 10 Updated May 1, 2025

Generate accurate transcripts using Apple's MLX framework

Python 442 40 Updated Apr 26, 2025

An extensive node suite for ComfyUI with over 210 new nodes

Jupyter Notebook 1,671 253 Updated Jun 2, 2025

ControlNet++: All-in-one ControlNet for image generations and editing!

Python 2,087 63 Updated Sep 30, 2024

toolkit for interactive exhibitions

Python 292 28 Updated Nov 13, 2025
Next