Skip to content
View thomasjhuang's full-sized avatar

Block or report thomasjhuang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Fast, small, and fully autonomous AI assistant infrastructure — deploy anywhere, swap anything 🦀

Rust 13,535 1,392 Updated Feb 19, 2026

Minimal Claude Code alternative. Single Python file, zero dependencies, ~250 lines.

Python 1,996 176 Updated Jan 14, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 23,594 4,481 Updated Feb 19, 2026

Go ahead and axolotl questions

Python 11,303 1,252 Updated Feb 18, 2026

Render After Effects animations natively on Web, Android and iOS, and React Native. http://airbnb.io/lottie/

JavaScript 31,693 2,929 Updated Sep 1, 2025

[ICML 2024] Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference

Cuda 372 40 Updated Jul 10, 2025

Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparen…

Python 2,681 360 Updated Feb 19, 2026

A framework for few-shot evaluation of language models.

Python 11,454 3,049 Updated Feb 15, 2026

LLM training in simple, raw C/CUDA

Cuda 28,927 3,390 Updated Jun 26, 2025

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 53,424 9,040 Updated Nov 12, 2025

Deep learning for dummies. All the practical details and useful utilities that go into working with real models.

Python 828 43 Updated Jul 29, 2025

FastAPI framework, high performance, easy to learn, fast to code, ready for production

Python 95,268 8,697 Updated Feb 18, 2026

Example code for Fluent Python, 2nd edition (O'Reilly 2022)

Python 3,950 1,147 Updated Oct 28, 2025

Understanding Deep Learning - Simon J.D. Prince

Jupyter Notebook 9,091 2,128 Updated Feb 9, 2026

[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.

Python 504 73 Updated Aug 1, 2024

[CoLM'25] The official implementation of the paper <MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression>

Python 154 8 Updated Jan 14, 2026

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 7,185 394 Updated Jul 11, 2024

Fast and memory-efficient exact attention

Python 22,290 2,390 Updated Feb 18, 2026

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 31,397 3,906 Updated Feb 19, 2026

Add-on agent to generate and expose cluster-level metrics.

Go 6,064 2,151 Updated Feb 18, 2026

Repo to submit jobs to the AMD cluster

Python 11 Updated Oct 30, 2024

Solve puzzles. Improve your pytorch.

Jupyter Notebook 3,922 351 Updated Jul 15, 2024

Solve puzzles. Learn CUDA.

Jupyter Notebook 11,956 925 Updated Sep 1, 2024

The Python programming language

Python 71,595 34,099 Updated Feb 19, 2026

Efficient Triton Kernels for LLM Training

Python 6,147 489 Updated Feb 13, 2026

Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.

Go 6,740 785 Updated Feb 18, 2026

A comprehensive catalog of modern and classic books on C++ programming language

1,555 256 Updated May 26, 2024

My own templates and implementation of important algorithms and data structures for competitive programming

C++ 426 131 Updated Apr 22, 2025

Empowering everyone to build reliable and efficient software.

Rust 110,464 14,509 Updated Feb 19, 2026
Next