Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
LinearKAN: A very fast implementation of Kolmogorov-Arnold Networks
Development repository for the Triton language and compiler
A collection of full time roles in SWE, Quant, and PM for new grads.
A high-throughput and memory-efficient inference and serving engine for LLMs
Tilus is a tile-level kernel programming language with explicit control over shared memory and registers.
Efficient Triton Kernels for LLM Training
LM engine is a library for pretraining/finetuning LLMs
A machine learning compiler for GPUs, CPUs, and ML accelerators
A curated list of papers of interesting empirical study and insight on deep learning. Continually updating...
Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs
The AMFormer algorithm, accepted at AAAI-2024, for deep tabular learning
A modular framework for neural networks with Euclidean symmetry
Visualization and calculator for input & output for deep neural networks.
CPU and GPU implementations of some 2D RNN layers
You like pytorch? You like micrograd? You love tinygrad! ❤️
Chrome/Firefox extension that blocks access to distracting websites to improve your productivity.
ASU-sparkysundevil-resume-template
Tips and resources to prepare for Behavioral interviews.
Sample code for the Microsoft Cognitive Services Speech SDK
Flux diffusion model implementation using quantized fp8 matmul & remaining layers use faster half precision accumulate, which is ~2x faster on consumer devices.
public facing repo of my algorithm running on platform
QuantSC Spring '23 Project
🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.
Adaptive Quantile Activation (AQUA): A learnable activation function that dynamically adapts to input distribution