Skip to content
View kkuoo7's full-sized avatar

Block or report kkuoo7

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A library of long-horizon Task-and-Motion-Planning (TAMP) problems in kitchen and household scenes, as well as planners to solve them

Jupyter Notebook 153 26 Updated May 15, 2025

StelLA: Subspace Learning in Low-rank Adaptation using Stiefel Manifold (NeurIPS 2025 Spotlight)

Python 13 Updated Oct 20, 2025

[NeurIPS 2025] Official Implementation of ViSpec: Accelerating Vision-Language Models with Vision-Aware Speculative Decoding.

Python 31 Updated Oct 31, 2025

Motion correction application using on-device AI.

Dart 2 1 Updated Jul 24, 2023

Train speculative decoding models effortlessly and port them smoothly to SGLang serving.

Python 474 108 Updated Nov 15, 2025

Fast and exact implementation of the C++ from_chars functions for number types: 4x to 10x faster than strtod, part of GCC 12, MySQL, Chromium, Redis and WebKit/Safari

C++ 1,917 167 Updated Nov 3, 2025

Official code of "StreamBP: Memory-Efficient Exact Backpropagation for Long Sequence Training of LLMs".

Python 74 5 Updated Jun 23, 2025

Modern Fixed Point Systems using Pytorch

Python 122 13 Updated Oct 31, 2023

The Art of Debugging

Python 1,140 55 Updated Nov 17, 2025

A Collection of Papers on Diffusion Language Models

145 6 Updated Sep 15, 2025

Continuous Thought Machines, because thought takes time and reasoning is a process.

Python 1,397 205 Updated Oct 14, 2025

Physics of Language Models, Part 4

HTML 260 13 Updated Jul 29, 2025

Docs of the Hugging Face Hub

Handlebars 471 378 Updated Nov 17, 2025

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 3,941 318 Updated Nov 17, 2025

Code for ICLR 2025 paper "Emergence of a High-Dimensional Abstraction Phase in Language Transformers"

Python 3 1 Updated Jan 23, 2025

Training Large Language Model to Reason in a Continuous Latent Space

Python 1,338 141 Updated Aug 12, 2025

Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)

Python 9,274 389 Updated Aug 12, 2025

[SIGMOD' 25] A fast parallel kd-tree implementation

C++ 85 5 Updated Nov 16, 2025

Cost-efficient and pluggable Infrastructure components for GenAI inference

Go 4,412 485 Updated Nov 17, 2025

My learning notes/codes for ML SYS.

Python 4,186 254 Updated Nov 17, 2025

Data Structure Algorithms, (GenAI/ML) System Design, Machine Learning, DevOps coding interview practices

516 135 Updated Oct 7, 2025

The nnsight package enables interpreting and manipulating the internals of deep learned models.

Jupyter Notebook 697 63 Updated Nov 14, 2025
Jupyter Notebook 182 9 Updated May 16, 2025

Backend.AI is a streamlined, container-based computing cluster platform that hosts popular computing/ML frameworks and diverse programming languages, with pluggable heterogeneous accelerator suppor…

Python 588 163 Updated Nov 17, 2025

Official implementation of "TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models"

Python 118 9 Updated Oct 6, 2025

The official implementation for [NeurIPS2025 Oral] Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free

Jupyter Notebook 106 6 Updated Sep 19, 2025

A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.

Python 431 26 Updated Mar 10, 2025
Next