Skip to content
View pongib's full-sized avatar

Block or report pongib

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Learn CUDA with PyTorch

Cuda 156 23 Updated Dec 21, 2025

TTS model capable of streaming conversational audio in realtime.

Python 996 81 Updated Nov 29, 2025

MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.

Python 3,025 266 Updated Jul 7, 2025

"AI-Trader: Can AI Beat the Market?" Live Trading Bench: https://ai4trade.ai Tech Report Link: https://arxiv.org/abs/2512.10971

Python 10,515 1,712 Updated Dec 19, 2025

FlashInfer: Kernel Library for LLM Serving

Python 4,409 621 Updated Jan 2, 2026

LLM101n: Let's build a Storyteller

36,053 1,962 Updated Aug 1, 2024

The best ChatGPT that $100 can buy.

Python 39,617 5,057 Updated Jan 1, 2026

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 3,192 196 Updated Oct 9, 2025

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 7,779 711 Updated Dec 30, 2025

An open-source, real-time streaming Automatic Speech Recognition (ASR) model for Thai, optimized for low-latency CPU deployment.

Python 32 6 Updated Nov 28, 2025

Code and training scripts for FlexOlmo

Python 120 16 Updated Dec 18, 2025

Code release for NeX: Real-time View Synthesis with Neural Basis Expansion

Python 606 73 Updated Mar 3, 2022

Text-audio foundation model from Boson AI

Python 116 21 Updated Sep 4, 2025

Text-audio foundation model from Boson AI

Python 7,801 584 Updated Sep 15, 2025

Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.

Python 46 3 Updated Sep 15, 2025

A FastAPI wrapper for NVIDIA's new parakeet 0.6b v2 TTS 600-million-parameter model designed for high-quality English speech recognition

Python 136 40 Updated Oct 30, 2025

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 89,434 10,307 Updated Jan 2, 2026

Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.

Python 2,687 272 Updated Nov 26, 2025

Will write CUDA for 100 days

Cuda 35 4 Updated May 25, 2025

My submission for the GPUMODE/AMD fp8 mm challenge

Python 29 Updated Jun 4, 2025
Python 83 14 Updated Nov 28, 2025

The official Python SDK for Model Context Protocol servers and clients

Python 20,904 2,952 Updated Jan 1, 2026

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,950 288 Updated May 15, 2025

LLMPerf is a library for validating and benchmarking LLMs

Python 1,070 197 Updated Dec 9, 2024

Fastest kernels written from scratch

Cuda 507 61 Updated Sep 18, 2025

Fast CUDA matrix multiplication from scratch

Cuda 998 149 Updated Sep 2, 2025

CUDA/Metal accelerated language model inference

C 625 30 Updated May 29, 2025

Inference Llama 2 in one file of pure C

C 19,071 2,434 Updated Aug 6, 2024

Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O

C++ 544 50 Updated Sep 13, 2025
Next