Skip to content
View SonicCodes's full-sized avatar
:shipit:
bilding kkkkk
:shipit:
bilding kkkkk

Block or report SonicCodes

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Unofficial implementation of the toy example in JiT https://arxiv.org/abs/2511.13720

Jupyter Notebook 5 Updated Nov 24, 2025

MoBA: Mixture of Block Attention for Long-Context LLMs

Python 2,007 127 Updated Apr 3, 2025

Code for "What really matters in matrix-whitening optimizers?"

Python 17 1 Updated Oct 31, 2025

Let's train vision transformers (ViT) for cifar 10 / cifar 100!

Python 697 133 Updated Nov 20, 2025

An efficient implementation of the NSA (Native Sparse Attention) kernel

Python 126 4 Updated Jun 24, 2025

Pytorch implementation of MeanFlow on ImageNet and CIFAR10

Python 351 22 Updated Aug 23, 2025

[CVPR 2025] "DiC: Rethinking Conv3x3 Designs in Diffusion Models", a performant & speedy Conv3x3 diffusion model.

Python 215 16 Updated Jun 12, 2025

[NeurIPS 2025 Spotlight] DiCo: Revitalizing ConvNets for Scalable and Efficient Diffusion Modeling

Python 55 2 Updated Oct 31, 2025

Fast, Numerically Stable, and Auto-Differentiable Spectral Clipping via Newton-Schulz Iteration

Jupyter Notebook 12 Updated Jun 21, 2025

Fast low-bit matmul kernels in Triton

Python 401 29 Updated Nov 21, 2025

The CLI for GPUs

Python 117 4 Updated Nov 23, 2025

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python 2 Updated Jul 2, 2025

Official implementation of the paper: "ZClip: Adaptive Spike Mitigation for LLM Pre-Training".

Python 139 10 Updated Nov 20, 2025

F Lite is a 10B parameter diffusion model created by Freepik and Fal, trained exclusively on copyright-safe and SFW content.

Python 416 36 Updated Aug 25, 2025
Python 2 Updated Apr 20, 2025

Implementation of Jetformer.

Python 8 Updated May 10, 2025

Schedule-Free Optimization in PyTorch

Python 2,236 68 Updated May 21, 2025

Official implementation of Inductive Moment Matching

Python 564 13 Updated Jul 11, 2025

An End-To-End, Lightweight and Flexible Platform for Game Research

C++ 2,096 284 Updated Aug 30, 2021

⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation (AAAI 2025 Oral)

Python 639 41 Updated Mar 11, 2025

research impl of Native Sparse Attention (2502.11089)

Python 63 3 Updated Feb 19, 2025

Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)

Python 34 5 Updated Feb 27, 2025

Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)

Python 1 Updated Jan 31, 2025
Python 157 10 Updated Oct 15, 2025

maximal update parametrization (µP)

Jupyter Notebook 1,634 104 Updated Jul 17, 2024

Training Large Language Model to Reason in a Continuous Latent Space

Python 1,356 143 Updated Aug 12, 2025

Library for reading and processing ML training data.

Python 607 56 Updated Nov 27, 2025
Next