-
HuggingFace
- France
- 3outeille.github.io
- @FerdinandMom
- @FerdinandMom
-
veScale Public
Forked from volcengine/veScaleA PyTorch Native LLM Training Framework
Python Apache License 2.0 UpdatedSep 29, 2025 -
prime-rl Public
Forked from PrimeIntellect-ai/prime-rlDecentralized RL Training at Scale
Python Apache License 2.0 UpdatedSep 19, 2025 -
experiments-with-kernels Public
Forked from Vaibhavs10/experiments-with-kernelsPython UpdatedAug 29, 2025 -
torchtitan Public
Forked from pytorch/torchtitanA native PyTorch Library for large model training
Python BSD 3-Clause "New" or "Revised" License UpdatedAug 28, 2025 -
gpt-oss-recipes Public
Forked from huggingface/gpt-oss-recipesCollection of scripts and notebooks for OpenAI's latest GPT OSS models
Jupyter Notebook Apache License 2.0 UpdatedAug 27, 2025 -
kernels Public
Forked from huggingface/kernelsLoad compute kernels from the Hub
Python Apache License 2.0 UpdatedAug 27, 2025 -
tilelang Public
Forked from tile-ai/tilelangDomain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels
C++ Other UpdatedAug 26, 2025 -
kernel-builder Public
Forked from huggingface/kernel-builder👷 Build compute kernels
-
peft Public
Forked from huggingface/peft🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Python Apache License 2.0 UpdatedAug 14, 2025 -
quack Public
Forked from Dao-AILab/quackA Quirky Assortment of CuTe Kernels
Python Apache License 2.0 UpdatedJul 10, 2025 -
pytorch Public
Forked from pytorch/pytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
Python Other UpdatedJul 4, 2025 -
DeepSpeed Public
Forked from deepspeedai/DeepSpeedDeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Python Apache License 2.0 UpdatedJun 30, 2025 -
-
prime Public
Forked from PrimeIntellect-ai/primeprime is a framework for efficient, globally distributed training of AI models over the internet.
-
DualPipe Public
Forked from deepseek-ai/DualPipeA bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.
Python MIT License UpdatedFeb 27, 2025 -
nccl Public
Forked from NVIDIA/ncclOptimized primitives for collective multi-GPU communication
C++ Other UpdatedFeb 17, 2025 -
picotron-deepseek Public
Forked from huggingface/picotronMinimalistic 4D-parallelism distributed training framework for education purpose
Python Apache License 2.0 UpdatedJan 24, 2025 -
Megatron-LM Public
Forked from NVIDIA/Megatron-LMOngoing research training transformer models at scale
Python Other UpdatedJan 17, 2025 -
nccl-tests Public
Forked from NVIDIA/nccl-testsNCCL Tests
Cuda BSD 3-Clause "New" or "Revised" License UpdatedDec 12, 2024 -
EasyContext Public
Forked from jzhang38/EasyContextMemory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
Python Apache License 2.0 UpdatedSep 27, 2024 -
-
ring-attention-pytorch Public
Forked from lucidrains/ring-attention-pytorchImplementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch
Python MIT License UpdatedSep 25, 2024 -
-
litgpt Public
Forked from Lightning-AI/litgpt20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
Python Apache License 2.0 UpdatedAug 12, 2024 -
dust Public
Forked from kelpsyberry/dustA Nintendo DS emulator written in Rust for desktop devices and the web, with debugging features and a focus on accuracy
Rust GNU General Public License v3.0 UpdatedAug 6, 2024 -
fms-fsdp Public
Forked from foundation-model-stack/fms-fsdpDemonstrate throughput of PyTorch FSDP
Python Apache License 2.0 UpdatedJul 5, 2024 -
minRF Public
Forked from cloneofsimo/minRFMinimal implementation of scalable rectified flow transformers, based on SD3's approach
-
ColossalAI Public
Forked from hpcaitech/ColossalAIMaking large AI models cheaper, faster and more accessible
Python Apache License 2.0 UpdatedJun 14, 2024 -
diloco_simple Public
Forked from PrimeIntellect-ai/diloco_simpletorch implementation of diloco
Python UpdatedMay 31, 2024 -
lighteval Public
Forked from huggingface/lightevalLightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.
Python MIT License UpdatedMay 23, 2024