ColeDrain

Ugochukwu Onyebuchi ColeDrain

Software Engineer. MLOps Engineer. AI Engineer. Go. Python. Godot. Tailwind. Figma.

7 followers · 6 following

Indicina
Remote (Lagos)
https://linkedin.com/in/ugochukwu-onyebuchi
@VinciSon
in/ugochukwu-onyebuchi

Achievements

Lists (3)

Sort

Stars

deepseek-ai / DeepSeek-OCR

Contexts Optical Compression

Python 19,709 1,389 Updated Oct 25, 2025

myshell-ai / OpenVoice

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 35,381 3,888 Updated Apr 19, 2025

PaddlePaddle / PaddleOCR

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 62,825 9,258 Updated Nov 6, 2025

SYSTRAN / faster-whisper

Faster Whisper transcription with CTranslate2

Python 18,890 1,569 Updated Oct 31, 2025

m-bain / whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 18,606 1,972 Updated Oct 21, 2025

CorentinJ / Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 58,773 9,372 Updated Sep 23, 2025

rednote-hilab / dots.ocr

Multilingual Document Layout Parsing in a Single Vision-Language Model

Python 5,594 562 Updated Oct 31, 2025

x1xhlol / system-prompts-and-models-of-ai-tools

FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus Agent Tools, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae…

94,653 25,535 Updated Nov 1, 2025

tesseract-ocr / tesseract

Tesseract Open Source OCR Engine (main repository)

C++ 70,736 10,360 Updated Oct 13, 2025

SkyworkAI / SkyReels-V2

SkyReels-V2: Infinite-length Film Generative model

Python 4,897 693 Updated Aug 11, 2025

huggingface / speech-to-speech

Speech To Speech: an effort for an open-sourced and modular GPT4-o

Python 4,225 485 Updated Apr 15, 2025

ictnlp / LLaMA-Omni

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 3,089 215 Updated May 19, 2025

lharries / whatsapp-mcp

WhatsApp MCP server

Go 5,044 776 Updated Jul 13, 2025

Goldziher / kreuzberg

Document intelligence framework for Python - Extract text, metadata, and structured data from PDFs, images, Office documents, and more. Built on Pandoc, PDFium, and Tesseract.

HTML 2,490 109 Updated Nov 6, 2025

kluctl / go-embed-python

A library that provides an embedded python distribution to be usable from inside golang

Go 320 30 Updated Jan 2, 2025

Lightning-AI / LitServe

Build custom inference engines for models, agents, multi-modal systems, RAG, pipelines and more.

Python 3,683 254 Updated Nov 4, 2025

aidotse / docling-inference

API service for docling document conversion

Python 37 9 Updated Feb 20, 2025

docling-project / docling-serve

Running Docling as an API service

Python 907 202 Updated Oct 31, 2025

docling-project / docling

Get your documents ready for gen AI

Python 43,130 3,088 Updated Nov 6, 2025

coqui-ai / open-bible-scripts

scipts for working with open.bible data

Shell 25 14 Updated Jan 24, 2022

rioharper / VocalForge

Your one-stop solution for voice dataset creation

Python 127 24 Updated Dec 10, 2023

ColeDrain / chatshell

ChatShell is a productivity tool for the command-line, powered by OpenAI's GPT-3 language model. It helps users find shell commands quickly and easily, reducing the need to search online and improv…

Go 2 Updated Mar 25, 2023

idiap / coqui-ai-TTS

Forked from coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 1,962 244 Updated Oct 16, 2025

Ugochukwu Onyebuchi ColeDrain

Lists (3)

Docling Ecosystem

🚀 My stack

TTS Repos

Stars