-
-
-
flash-attention Public
Forked from Dao-AILab/flash-attentionFast and memory-efficient exact attention
Python BSD 3-Clause "New" or "Revised" License UpdatedNov 27, 2025 -
driss_torch Public
Cuda extensions for PyTorch
-
cutlass Public
Forked from NVIDIA/cutlassCUDA Templates for Linear Algebra Subroutines
C++ Other UpdatedNov 22, 2025 -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedNov 20, 2025 -
transformer_nuggets Public
A place to store reusable transformer components of my own creation or found on the interwebs
-
-
-
pytorch Public
Forked from pytorch/pytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
-
simple_cuda Public
Learnings + Exercises from the PMPP book!
-
FBGEMM Public
Forked from pytorch/FBGEMMFB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
C++ Other UpdatedAug 29, 2025 -
-
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.
Python Apache License 2.0 UpdatedJun 3, 2025 -
lean_ua Public
Lean 4 formalizations of proofs from Stephen Abbott's Understanding Analysis textbook
-
tritonbench Public
Forked from meta-pytorch/tritonbenchTritonbench is a collection of PyTorch custom operators with example inputs to measure their performance.
Python BSD 3-Clause "New" or "Revised" License UpdatedApr 12, 2025 -
-
ao Public
Forked from pytorch/aoThe torchao repository contains api's and workflows for quantization and pruning gpu models.
Python BSD 3-Clause "New" or "Revised" License UpdatedFeb 15, 2025 -
sglang Public
Forked from sgl-project/sglangSGLang is a fast serving framework for large language models and vision language models.
Python Apache License 2.0 UpdatedDec 23, 2024 -
nanoGPT Public
Forked from karpathy/nanoGPTThe simplest, fastest repository for training/finetuning medium-sized GPTs.
Python MIT License UpdatedNov 25, 2024 -
triton Public
Forked from triton-lang/tritonDevelopment repository for the Triton language and compiler
C++ MIT License UpdatedOct 25, 2024 -
torchtitan Public
Forked from pytorch/torchtitanA native PyTorch Library for large model training
Python BSD 3-Clause "New" or "Revised" License UpdatedOct 4, 2024 -
-
foundation-model-stack Public
Forked from AdnanHoque/foundation-model-stack🚀 Collection of components for development, training, tuning, and inference of foundation models leveraging PyTorch native components.
Python Apache License 2.0 UpdatedSep 9, 2024 -
executorch Public
Forked from pytorch/executorchOn-device AI across mobile, embedded and edge for PyTorch
C++ Other UpdatedAug 19, 2024 -
pytorch.github.io Public
Forked from pytorch/pytorch.github.ioThe website for PyTorch
HTML BSD 3-Clause "New" or "Revised" License UpdatedAug 9, 2024 -
tensordict Public
Forked from pytorch/tensordictTensorDict is a pytorch dedicated tensor container.
Python MIT License UpdatedAug 5, 2024 -
attention-gym Public
Helpful tools and examples for working with flex-attention
-
tlparse Public
Forked from meta-pytorch/tlparseTORCH_LOGS parser for PT2
Rust BSD 3-Clause "New" or "Revised" License UpdatedJul 15, 2024 -
builder Public
Forked from pytorch/builderContinuous builder and binary build scripts for pytorch
Shell BSD 2-Clause "Simplified" License UpdatedJun 20, 2024