Skip to content
View thuwzt's full-sized avatar

Organizations

@thu-ml

Block or report thuwzt

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse–Linear Attention

Python 221 12 Updated Dec 28, 2025

Official repo for vidar and vidarc: video foundation model for robotics.

Python 28 Updated Dec 22, 2025

TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

Python 2,967 193 Updated Jan 1, 2026

A Survey of Efficient Attention Methods: Hardware-efficient, Sparse, Compact, and Linear Attention

268 5 Updated Dec 1, 2025

The official implementation for [NeurIPS2025 Oral] Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free

Jupyter Notebook 750 48 Updated Dec 20, 2025

[ICML2025] SpargeAttention: A training-free sparse attention that accelerates any model inference.

Cuda 880 77 Updated Dec 31, 2025

Pytorch implementation of "Oscillation-Reduced MXFP4 Training for Vision Transformers" on DeiT Model Pre-training

Python 34 4 Updated Jun 20, 2025

🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.

16,581 1,529 Updated Feb 13, 2023

Official implementation for "Pruning Large Language Models with Semi-Structural Adaptive Sparse Training" (AAAI 2025)

Python 17 2 Updated Jul 1, 2025

[ICLR2025] Codebase for "ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing", built on Megatron-LM.

Python 104 7 Updated Dec 20, 2024

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Python 6,633 381 Updated Jun 2, 2025

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

Cuda 2,985 301 Updated Dec 22, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,397 3,254 Updated Jan 2, 2026

Ongoing research training transformer models at scale

Python 14,772 3,441 Updated Jan 2, 2026

Triton-based implementation of Sparse Mixture of Experts.

Python 259 26 Updated Oct 3, 2025

Development repository for the Triton language and compiler

MLIR 18,010 2,481 Updated Jan 2, 2026

[TMLR 2024] Efficient Large Language Models: A Survey

1,245 98 Updated Jun 23, 2025

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance…

Python 3,043 596 Updated Jan 2, 2026

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 41,130 4,676 Updated Jan 1, 2026

Official code for "Efficient Backpropagation with Variance Controlled Adaptive Sampling" (ICLR 2024)

Python 8 2 Updated Mar 8, 2024

Fast and memory-efficient exact attention

Python 21,398 2,258 Updated Jan 1, 2026

Low-bit optimizers for PyTorch

Python 137 9 Updated Oct 9, 2023

A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。

JavaScript 43,785 5,370 Updated Dec 1, 2025

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 51,618 8,650 Updated Nov 12, 2025

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 96,288 26,408 Updated Jan 2, 2026

LaTeX Thesis Template for Tsinghua University

TeX 5,096 1,133 Updated Dec 29, 2025

The JavaScript library that provides a program-friendly interface to Tsinghua web portal

TypeScript 28 5 Updated Sep 24, 2023

清华大学计算机系课程攻略 Guidance for courses in Department of Computer Science and Technology, Tsinghua University

HTML 36,343 7,834 Updated Nov 28, 2025