cats256

Follow

cats256

Follow

A hundred years of living.

5 followers · 5 following

Achievements

Achievements

Highlights

Pro

Lists (1)

Sort

🔮 Future ideas

Stars

catswe / LinearKAN

LinearKAN: A very fast implementation of Kolmogorov-Arnold Networks

Python 17 1 Updated Nov 12, 2025

NVIDIA / cuEmbed

CUDA Embedding Lookup Kernel Library

Cuda 40 5 Updated Oct 21, 2025

triton-lang / triton

Development repository for the Triton language and compiler

MLIR 18,079 2,495 Updated Jan 10, 2026

leochlon / pythea

Python 1,269 123 Updated Jan 9, 2026

SimplifyJobs / New-Grad-Positions

A collection of full time roles in SWE, Quant, and PM for new grads.

16,006 1,242 Updated Jan 10, 2026

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 67,192 12,491 Updated Jan 10, 2026

NVIDIA / tilus

Tilus is a tile-level kernel programming language with explicit control over shared memory and registers.

Python 437 16 Updated Dec 16, 2025

AlphaGPU / leetgpu-challenges

LeetGPU Challenges

Python 579 46 Updated Jan 3, 2026

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 6,027 459 Updated Jan 7, 2026

open-lm-engine / lm-engine

LM engine is a library for pretraining/finetuning LLMs

Python 108 24 Updated Jan 8, 2026

openxla / xla

A machine learning compiler for GPUs, CPUs, and ML accelerators

C++ 3,887 721 Updated Jan 10, 2026

MinghuiChen43 / awesome-deep-phenomena

A curated list of papers of interesting empirical study and insight on deep learning. Continually updating...

383 15 Updated Jan 7, 2026

HazyResearch / Megakernels

kernels, of the mega variety

Python 641 35 Updated Sep 28, 2025

jax-ml / scaling-book

Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs

HTML 803 116 Updated Jan 10, 2026

aigc-apps / AMFormer

The AMFormer algorithm, accepted at AAAI-2024, for deep tabular learning

Python 41 10 Updated Jul 3, 2024

e3nn / e3nn

A modular framework for neural networks with Euclidean symmetry

Python 1,197 176 Updated Jan 9, 2026

zimonitrome / convolution-shape-calculator

Visualization and calculator for input & output for deep neural networks.

TypeScript 16 3 Updated Jul 28, 2025

jpuigcerver / rnn2d

CPU and GPU implementations of some 2D RNN layers

C++ 29 10 Updated Sep 23, 2017

tinygrad / tinygrad

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 31,103 3,830 Updated Jan 10, 2026

penge / block-site

Chrome/Firefox extension that blocks access to distracting websites to improve your productivity.

TypeScript 367 50 Updated Nov 17, 2025

vgnshiyer / ASU-sparkysundevil-resume-template

ASU-sparkysundevil-resume-template

TeX 32 21 Updated Oct 3, 2024

ashishps1 / awesome-behavioral-interviews

Tips and resources to prepare for Behavioral interviews.

7,535 1,480 Updated Aug 19, 2025

okdalto / conv_visualizer

conv_visualizer

Processing 490 44 Updated Dec 1, 2024

Azure-Samples / cognitive-services-speech-sdk

Sample code for the Microsoft Cognitive Services Speech SDK

C# 3,381 2,000 Updated Jan 9, 2026

aredden / flux-fp8-api

Flux diffusion model implementation using quantized fp8 matmul & remaining layers use faster half precision accumulate, which is ~2x faster on consumer devices.

Python 285 35 Updated Oct 12, 2024

gbaydin / hypergradient-descent

Hypergradient descent

Python 147 21 Updated May 31, 2024

orangejuicetin / kalshi_market_maker

public facing repo of my algorithm running on platform

Python 4 Updated Nov 20, 2024

nikhilnd / kalshi-market-making

QuantSC Spring '23 Project

Jupyter Notebook 58 7 Updated May 18, 2023

dair-ai / Prompt-Engineering-Guide

🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.

MDX 68,963 7,350 Updated Dec 29, 2025

cats256 / quantile-spline-activation

Adaptive Quantile Activation (AQUA): A learnable activation function that dynamically adapts to input distribution

1 Updated Dec 12, 2024