1SAA

🤓

Coding

Haichen Huang 1SAA

🤓

Coding

Machine learning systems, parallel computing, and distributed training.

62 followers · 36 following

@hpcaitech
Beijing, China
10:48 (UTC +08:00)

Achievements

x3 x3

Achievements

x3 x3

Stars

BBuf / how-to-optim-algorithm-in-cuda

how to optimize some algorithm in cuda.

Cuda 2,647 239 Updated Nov 27, 2025

lencx / ChatGPT

🔮 ChatGPT Desktop Application (Mac, Windows and Linux)

Rust 54,338 6,202 Updated Aug 29, 2024

merrymercy / awesome-tensor-compilers

A list of awesome compiler projects and papers for tensor computation and deep learning.

2,686 322 Updated Oct 19, 2024

dair-ai / ML-Papers-Explained

Explanation to key concepts in ML

8,125 663 Updated Jun 30, 2025

hpcaitech / Elixir

Elixir: Train a Large Language Model on a Small GPU Cluster

Python 15 5 Updated Jun 8, 2023

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 40,853 4,652 Updated Nov 26, 2025

dair-ai / ML-Papers-of-the-Week

🔥Highlighting the top ML papers every week.

12,112 750 Updated Jul 20, 2025

gpakosz / .tmux

Oh my tmux! My self-contained, pretty & versatile tmux configuration made with 💛🩷💙🖤❤️🤍

Shell 23,847 3,523 Updated Nov 13, 2025

triton-lang / triton

Development repository for the Triton language and compiler

MLIR 17,698 2,413 Updated Nov 27, 2025

hpcaitech / Titans

A collection of models built with ColossalAI

Python 32 16 Updated Nov 22, 2022

olcf / cuda-training-series

Training materials associated with NVIDIA's CUDA Training Series (www.olcf.ornl.gov/cuda-training-series/)

Cuda 911 333 Updated Aug 19, 2024

pre-commit / pre-commit

A framework for managing and maintaining multi-language pre-commit hooks.

Python 14,631 910 Updated Nov 25, 2025

JetBrains / ideavim

IdeaVim – A Vim engine for JetBrains IDEs

Kotlin 10,032 800 Updated Nov 28, 2025

1SAA / vit-cifar10

A correctness test for ViT in Cifar10.

Python 1 Updated Sep 9, 2022

hpcaitech / ColossalAI-Pytorch-lightning

Python 24 6 Updated Nov 22, 2022

Lightning-AI / pytorch-lightning

Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.

Python 30,534 3,623 Updated Nov 28, 2025

xmcp / PKU_EECS_UGR_THSS_2022

Forked from qu-tan-um/PKU_EECS_UGR_THSS

Latex Template for Undergraduate Thesis at School of EECS, Peking University

TeX 41 9 Updated Jun 3, 2022

hpcaitech / PaLM-colossalai

Scalable PaLM implementation of PyTorch

Python 189 27 Updated Dec 19, 2022

EleutherAI / the-pile

Python 1,616 144 Updated Apr 27, 2023

hpcaitech / ColossalAI-Examples

Examples of training models with hybrid parallelism using ColossalAI

Python 339 102 Updated Mar 23, 2023

hpcaitech / ColossalAI-Benchmark

Performance benchmarking with ColossalAI

Python 38 16 Updated Jul 6, 2022

hpcaitech / SkyComputing

Sky Computing: Accelerating Geo-distributed Computing in Federated Learning

Python 91 21 Updated Nov 22, 2022

PKUFlyingPig / cs-self-learning

计算机自学指南

HTML 69,582 7,744 Updated Nov 28, 2025

NVIDIA / nccl

Optimized primitives for collective multi-GPU communication

C++ 4,274 1,078 Updated Nov 10, 2025

NVIDIA / nccl-tests

NCCL Tests

Cuda 1,344 334 Updated Nov 21, 2025

pytorch / pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 95,444 26,038 Updated Nov 29, 2025

hpcaitech / ColossalAI

Making large AI models cheaper, faster and more accessible

Python 41,274 4,541 Updated Nov 24, 2025

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 14,349 3,322 Updated Nov 28, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Haichen Huang 1SAA

Achievements

Achievements

Block or report 1SAA

Stars

BBuf / how-to-optim-algorithm-in-cuda

lencx / ChatGPT

merrymercy / awesome-tensor-compilers

dair-ai / ML-Papers-Explained

hpcaitech / Elixir

deepspeedai / DeepSpeed

dair-ai / ML-Papers-of-the-Week

gpakosz / .tmux

triton-lang / triton

hpcaitech / Titans

olcf / cuda-training-series

pre-commit / pre-commit

JetBrains / ideavim

1SAA / vit-cifar10

hpcaitech / ColossalAI-Pytorch-lightning

Lightning-AI / pytorch-lightning

xmcp / PKU_EECS_UGR_THSS_2022

hpcaitech / PaLM-colossalai

EleutherAI / the-pile

hpcaitech / ColossalAI-Examples

hpcaitech / ColossalAI-Benchmark

hpcaitech / SkyComputing

PKUFlyingPig / cs-self-learning

NVIDIA / nccl

NVIDIA / nccl-tests

pytorch / pytorch

hpcaitech / ColossalAI

NVIDIA / Megatron-LM