Skip to content
View tridao's full-sized avatar

Highlights

  • Pro

Organizations

@Dao-AILab

Block or report tridao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Open ABI and FFI for Machine Learning Systems

C++ 348 61 Updated Feb 19, 2026

Unofficial description of the CUDA assembly (SASS) instruction sets.

Python 201 19 Updated Jul 18, 2025

H-Net: Hierarchical Network with Dynamic Chunking

Python 813 95 Updated Nov 20, 2025

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 9,296 1,687 Updated Feb 18, 2026

Kimi K2 is the large language model series developed by Moonshot AI team

10,392 781 Updated Jan 21, 2026

A high-performance library for compressed ndarrays, with a flexible computational engine

Python 193 36 Updated Feb 19, 2026

Development repository for the Triton language and compiler

MLIR 18,452 2,593 Updated Feb 20, 2026

FlashAttention (Metal Port)

Swift 580 37 Updated Sep 22, 2024

Fast and memory-efficient exact attention

Python 22,302 2,393 Updated Feb 20, 2026

A PyTorch-based Speech Toolkit

Python 11,222 1,651 Updated Feb 11, 2026

OSLO: Open Source framework for Large-scale model Optimization

Python 309 30 Updated Aug 25, 2022

Machine learning metrics for distributed, scalable PyTorch applications.

Python 2,406 475 Updated Feb 18, 2026

Flexible Python configuration system. The last one you will ever need.

Python 2,346 147 Updated Nov 29, 2025

PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. ⚡🔥⚡

Python 5,155 755 Updated Aug 16, 2024

Constrained optimization toolkit for PyTorch

Python 707 35 Updated Jul 29, 2025

KErnel OPerationS, on CPUs and GPUs, with autodiff and without memory overflows

Python 1,158 76 Updated Feb 6, 2026

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 34,909 3,428 Updated Feb 20, 2026

Emacs client/library for the Language Server Protocol

Emacs Lisp 5,060 962 Updated Feb 14, 2026

lsp-mode ❤️ Microsoft's python language server

Emacs Lisp 188 41 Updated Jul 31, 2023

Hydra is a framework for elegantly configuring complex applications

Python 10,203 806 Updated Feb 7, 2026

Research workflows made easy, locally and in the Cloud.

Python 500 67 Updated Jun 6, 2024

PyTorch Extension Library of Optimized Scatter Operations

Python 1,726 205 Updated Jan 21, 2026

Kernel Tuner

Python 385 63 Updated Feb 17, 2026

Debug PyTorch code using PySnooper

Python 800 43 Updated Apr 28, 2021

Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.

Python 30,852 3,670 Updated Feb 16, 2026

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding a…

Python 2,395 446 Updated Mar 14, 2022

Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.

Python 4,357 390 Updated Oct 22, 2025

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 41,403 7,240 Updated Feb 20, 2026

Jupyter notebook client in Emacs

Emacs Lisp 1,513 128 Updated Dec 12, 2025

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 97,605 26,914 Updated Feb 20, 2026
Next