dunky11

🚀

Tim von Känel dunky11

🚀

I like speech synthesis and I like to hoard data.

181 followers · 60 following

Achievements

x3 x3

Achievements

x3 x3

Highlights

Stars

rom1504 / img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Python 4,223 365 Updated Oct 19, 2025

Picovoice / cobra

On-device voice activity detection (VAD) powered by deep learning

Python 233 15 Updated Nov 19, 2025

cdown / srt

A simple library and set of tools for parsing, modifying, and composing SRT files.

Python 526 49 Updated Mar 19, 2024

getomni-ai / zerox

OCR & Document Extraction using vision models

TypeScript 11,969 820 Updated May 20, 2025

elevenlabs / elevenlabs-mcp

The official ElevenLabs MCP server

Python 1,071 183 Updated Nov 17, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 11,882 905 Updated Sep 30, 2025

MahmoudAshraf97 / ctc-forced-aligner

Text to speech alignment using CTC forced alignment

Python 391 71 Updated Aug 13, 2025

Tencent-Hunyuan / HunyuanVideo

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 11,327 1,135 Updated Nov 21, 2025

bshall / dusted

DUSTED: Spoken-Term Discovery using Discrete Speech Units

Jupyter Notebook 18 Updated Oct 2, 2024

allenai / peS2o

Pretraining Efficiently on S2ORC!

Python 173 6 Updated Oct 23, 2024

kyutai-labs / moshi

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 9,116 826 Updated Nov 20, 2025

facebookresearch / sam2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 17,802 2,233 Updated Dec 25, 2024

facebookresearch / schedule_free

Schedule-Free Optimization in PyTorch

Python 2,235 68 Updated May 21, 2025

vdmrgv / react-easy-infinite-scroll-hook

♾️ A react hook that makes it easy to add infinite scroll in any components. It is very simple to integrate and supports any direction.

TypeScript 107 7 Updated Nov 8, 2023

stripe-archive / flow-to-typescript-codemod

Codemod Stripe used to migrate 6.5m+ lines of code from Flow to TypeScript

TypeScript 691 74 Updated Apr 11, 2025

Yuan-ManX / ai-audio-datasets

AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio a…

864 82 Updated Jul 8, 2025

dragonflydb / dragonfly

A modern replacement for Redis and Memcached

C++ 29,394 1,119 Updated Nov 25, 2025

wq2012 / SpectralCluster

Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.

Python 543 73 Updated Sep 25, 2024

mui / material-ui

Material UI: Comprehensive React component library that implements Google's Material Design. Free forever.

JavaScript 97,345 32,796 Updated Nov 25, 2025

yjs / yjs

Shared data types for building collaborative software

JavaScript 20,628 720 Updated Nov 25, 2025

adelacvg / NS2VC

Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech

Python 236 12 Updated Feb 29, 2024

BitPhinix / slate-yjs

Yjs binding for Slate

TypeScript 547 77 Updated Jun 20, 2024

ttengwang / Caption-Anything

Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/sp…

Python 1,770 102 Updated Aug 29, 2023

ggml-org / llama.cpp

LLM inference in C/C++

C++ 90,392 13,823 Updated Nov 25, 2025

microsoft / i-Code

Jupyter Notebook 1,709 166 Updated Sep 27, 2024

xinjli / transphone

phoneme tokenizer and grapheme-to-phoneme model for 8k languages

Python 173 18 Updated Jun 9, 2023

modular / modular

The Modular Platform (includes MAX & Mojo)

Mojo 25,253 2,736 Updated Nov 24, 2025

reworkd / AgentGPT

🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.

TypeScript 35,270 9,488 Updated Apr 29, 2025

deep-floyd / IF

Python 7,843 526 Updated Apr 14, 2024

declare-lab / tango

A family of diffusion models for text-to-audio generation.

Python 1,214 106 Updated Jul 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tim von Känel dunky11

Achievements

Achievements

Highlights

Block or report dunky11

Stars

rom1504 / img2dataset

Picovoice / cobra

cdown / srt

getomni-ai / zerox

elevenlabs / elevenlabs-mcp

deepseek-ai / FlashMLA

MahmoudAshraf97 / ctc-forced-aligner

Tencent-Hunyuan / HunyuanVideo

bshall / dusted

allenai / peS2o

kyutai-labs / moshi

facebookresearch / sam2

facebookresearch / schedule_free

vdmrgv / react-easy-infinite-scroll-hook

stripe-archive / flow-to-typescript-codemod

Yuan-ManX / ai-audio-datasets

dragonflydb / dragonfly

wq2012 / SpectralCluster

mui / material-ui

yjs / yjs

adelacvg / NS2VC

BitPhinix / slate-yjs

ttengwang / Caption-Anything

ggml-org / llama.cpp

microsoft / i-Code

xinjli / transphone

modular / modular

reworkd / AgentGPT

deep-floyd / IF

declare-lab / tango