Skip to content
View iver56's full-sized avatar
  • ElevenLabs
  • Trondheim, Norway
  • X @iver56

Organizations

@ninjadev

Block or report iver56

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

AI + Data, online. https://vespa.ai

Java 6,495 668 Updated Oct 21, 2025

LongCat Audio Tokenizer and Detokenizer

Python 166 10 Updated Oct 20, 2025
Rust 10 1 Updated Oct 16, 2025

Retrieval-Augmented MOS Prediction with Prior Knowledge Integration

Python 31 3 Updated Mar 23, 2025
Python 40 3 Updated Oct 20, 2025

The best ChatGPT that $100 can buy.

Python 29,605 3,082 Updated Oct 21, 2025

A Python implementation of FastDTW

Python 835 126 Updated May 19, 2023

[ICLR 2025] OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation

Python 360 14 Updated May 30, 2025

Kroko ASR - Speech-to-text

C++ 83 7 Updated Oct 7, 2025

This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"

Python 209 15 Updated Oct 20, 2025

A novel data compression framework

C 2,425 94 Updated Oct 20, 2025

https://hf.co/hexgrad/Kokoro-82M

JavaScript 4,609 521 Updated Aug 6, 2025

Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.

Python 2,479 254 Updated Sep 22, 2025

On-device TTS model by Neuphonic

Python 3,575 326 Updated Oct 17, 2025

IIR Hilbert filter: short, dependency-free, header-only C++

C++ 39 3 Updated Dec 2, 2024

Cloud AI live transcription and translation service plugin

C++ 29 7 Updated Dec 19, 2024

YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open

Python 5,606 647 Updated Jun 4, 2025

A general purpose task-agnostic speech augmentation policy

Python 8 Updated Oct 2, 2025

[NeurIPS 2024] VFIMamba: Video Frame Interpolation with State Space Models

Python 128 10 Updated Sep 26, 2024

VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency

Python 159 19 Updated Oct 12, 2025

Banquet: A Stem-Agnostic Single-Decoder System for Music Source Separation Beyond Four Stems

Jupyter Notebook 82 4 Updated Jul 29, 2025

RAM is all you need

Python 191 16 Updated Oct 17, 2025

Audio decoding libraries for C/C++, each in a single source file.

C 1,540 229 Updated Sep 27, 2025

Fast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)

C++ 1,772 262 Updated Oct 21, 2025

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 7,122 643 Updated Oct 17, 2025

Optimized primitives for collective multi-GPU communication

C++ 4,165 1,040 Updated Oct 18, 2025

common in-memory tensor structure

C++ 1,085 154 Updated Oct 11, 2025

A feature-rich command-line audio/video downloader

Python 131,450 10,552 Updated Oct 18, 2025
Astro 1 2 Updated Oct 15, 2025
Next