iver56

Iver Jordal iver56

Machine learning, audio/music tech, computer vision, demoscene, web technology, games, startup. I mainly write Python. I also enjoy Go, C, Rust and JS.

367 followers · 170 following

ElevenLabs
Trondheim, Norway
@iver56

Achievements

x3 x3 x4

Achievements

x3 x3 x4

Organizations

Starred repositories

vespa-engine / vespa

AI + Data, online. https://vespa.ai

Java 6,495 668 Updated Oct 21, 2025

meituan-longcat / LongCat-Audio-Codec

LongCat Audio Tokenizer and Detokenizer

Python 166 10 Updated Oct 20, 2025

nirgoren / NoisePrints

Rust 10 1 Updated Oct 16, 2025

NKU-HLT / RAMP_MOS

Retrieval-Augmented MOS Prediction with Prior Knowledge Integration

Python 31 3 Updated Mar 23, 2025

SonyCSLParis / audio-metrics

Python 40 3 Updated Oct 20, 2025

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 29,605 3,082 Updated Oct 21, 2025

slaypni / fastdtw

A Python implementation of FastDTW

Python 835 126 Updated May 19, 2023

NJU-PCALab / OpenVid-1M

[ICLR 2025] OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation

Python 360 14 Updated May 30, 2025

kroko-ai / kroko-onnx

Kroko ASR - Speech-to-text

C++ 83 7 Updated Oct 7, 2025

SamsungSAILMontreal / TinyRecursiveModels

Python 5,016 655 Updated Oct 8, 2025

VsonicV / es-fine-tuning-paper

This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"

Python 209 15 Updated Oct 20, 2025

facebook / openzl

A novel data compression framework

C 2,425 94 Updated Oct 20, 2025

hexgrad / kokoro

https://hf.co/hexgrad/Kokoro-82M

JavaScript 4,609 521 Updated Aug 6, 2025

kyutai-labs / delayed-streams-modeling

Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.

Python 2,479 254 Updated Sep 22, 2025

neuphonic / neutts-air

On-device TTS model by Neuphonic

Python 3,575 326 Updated Oct 17, 2025

Signalsmith-Audio / hilbert-iir

IIR Hilbert filter: short, dependency-free, header-only C++

C++ 39 3 Updated Dec 2, 2024

royshil / cloudvocal

Cloud AI live transcription and translation service plugin

C++ 29 7 Updated Dec 19, 2024

multimodal-art-projection / YuE

YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open

Python 5,606 647 Updated Jun 4, 2025

gfdb / wav2aug

A general purpose task-agnostic speech augmentation policy

Python 8 Updated Oct 2, 2025

MCG-NJU / VFIMamba

[NeurIPS 2024] VFIMamba: Video Frame Interpolation with State Space Models

Python 128 10 Updated Sep 26, 2024

herimor / voxtream

VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency

Python 159 19 Updated Oct 12, 2025

kwatcharasupat / query-bandit

Banquet: A Stem-Agnostic Single-Decoder System for Music Source Separation Beyond Four Stems

Jupyter Notebook 82 4 Updated Jul 29, 2025

lodestone-rock / RamTorch

RAM is all you need

Python 191 16 Updated Oct 17, 2025

mackron / dr_libs

Audio decoding libraries for C/C++, each in a single source file.

C 1,540 229 Updated Sep 27, 2025

kfrlib / kfr

Fast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)

C++ 1,772 262 Updated Oct 21, 2025

snakers4 / silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 7,122 643 Updated Oct 17, 2025

NVIDIA / nccl

Optimized primitives for collective multi-GPU communication

C++ 4,165 1,040 Updated Oct 18, 2025

dmlc / dlpack

common in-memory tensor structure

C++ 1,085 154 Updated Oct 11, 2025

yt-dlp / yt-dlp

A feature-rich command-line audio/video downloader

Python 131,450 10,552 Updated Oct 18, 2025

hackheim / hackheimweb

Astro 1 2 Updated Oct 15, 2025