Skip to content
View tridao's full-sized avatar

Highlights

  • Pro

Organizations

@Dao-AILab

Block or report tridao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Unofficial description of the CUDA assembly (SASS) instruction sets.

Python 190 19 Updated Jul 18, 2025

H-Net: Hierarchical Network with Dynamic Chunking

Python 798 90 Updated Nov 20, 2025

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 9,039 1,603 Updated Dec 24, 2025

Kimi K2 is the large language model series developed by Moonshot AI team

9,787 715 Updated Nov 7, 2025

A high-performance library for compressed ndarrays, with a flexible computational engine

Python 183 30 Updated Jan 2, 2026

Development repository for the Triton language and compiler

MLIR 18,010 2,481 Updated Jan 2, 2026

FlashAttention (Metal Port)

Swift 571 36 Updated Sep 22, 2024

Fast and memory-efficient exact attention

Python 21,396 2,258 Updated Jan 1, 2026

A PyTorch-based Speech Toolkit

Python 10,996 1,619 Updated Jan 1, 2026

OSLO: Open Source framework for Large-scale model Optimization

Python 309 29 Updated Aug 25, 2022

Machine learning metrics for distributed, scalable PyTorch applications.

Python 2,388 470 Updated Dec 23, 2025

Flexible Python configuration system. The last one you will ever need.

Python 2,318 142 Updated Nov 29, 2025

PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. ⚡🔥⚡

Python 5,083 745 Updated Aug 16, 2024

Constrained optimization toolkit for PyTorch

Python 707 35 Updated Jul 29, 2025

KErnel OPerationS, on CPUs and GPUs, with autodiff and without memory overflows

Python 1,152 77 Updated Oct 31, 2025

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 34,476 3,337 Updated Jan 2, 2026

Emacs client/library for the Language Server Protocol

Emacs Lisp 5,046 951 Updated Dec 30, 2025

lsp-mode ❤️ Microsoft's python language server

Emacs Lisp 188 41 Updated Jul 31, 2023

Hydra is a framework for elegantly configuring complex applications

Python 10,070 775 Updated Dec 11, 2025

Research workflows made easy, locally and in the Cloud.

Python 500 66 Updated Jun 6, 2024

PyTorch Extension Library of Optimized Scatter Operations

Python 1,718 205 Updated Aug 12, 2025

Kernel Tuner

Python 378 60 Updated Dec 19, 2025

Debug PyTorch code using PySnooper

Python 801 43 Updated Apr 28, 2021

Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.

Python 30,661 3,635 Updated Dec 29, 2025

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding a…

Python 2,392 446 Updated Mar 14, 2022

Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.

Python 4,355 389 Updated Oct 22, 2025

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 40,581 7,061 Updated Jan 2, 2026

Jupyter notebook client in Emacs

Emacs Lisp 1,511 127 Updated Dec 12, 2025

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 96,288 26,408 Updated Jan 2, 2026

Automatic tiling window manager for macOS à la xmonad.

Swift 15,923 507 Updated Jan 2, 2026
Next