Skip to content
View whpy's full-sized avatar

Block or report whpy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Rust 153 10 Updated Jan 9, 2026

Remake of the original Super Mario Bros game.

C++ 328 83 Updated Feb 4, 2023

Learning in infinite dimension with neural operators.

Python 3,315 807 Updated Dec 26, 2025

本人的科研经验

9,723 522 Updated Dec 12, 2025

A repo for llm on ncnn

C++ 173 21 Updated Jan 2, 2026
Zig 2 Updated May 20, 2025

A linear algebra library for the Zig programming language

Zig 6 Updated Jul 22, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 6,046 793 Updated Jan 6, 2026
C++ 289 74 Updated Jan 18, 2021

An intuitive and low-overhead instrumentation tool for Python

Python 1,192 39 Updated Jul 8, 2025

Pytorch implementation for MeanFlow

Jupyter Notebook 288 25 Updated Jul 30, 2025

Variational Monte-Carlo updated PEPS

C++ 23 4 Updated Jan 10, 2026

JAX implementation of MeanFlow

Python 515 19 Updated Jul 30, 2025

Ongoing research training transformer models at scale

Python 14,853 3,481 Updated Jan 10, 2026

VideoNSA: Native Sparse Attention Scales Video Understanding

Python 77 2 Updated Nov 16, 2025

A library of GPU kernels for sparse matrix operations.

C++ 281 53 Updated Nov 24, 2020

Speedup the attention computation of Swin Transformer

Python 28 4 Updated Jun 14, 2025

A tiny deep learning training framework implemented from scratch in C++ that follows PyTorch's API.

C++ 143 25 Updated Dec 4, 2025

A really tiny autograd engine

Python 98 3 Updated May 26, 2025

LeetGPU Solutions

Python 95 5 Updated Oct 9, 2025

Pytorch Implementation (unofficial) of the paper "Mean Flows for One-step Generative Modeling" by Geng et al.

Python 1,012 58 Updated Dec 17, 2025

A Python toolkit for fine-tuning Geospatial Foundation Models (GFMs).

Python 702 127 Updated Jan 9, 2026

[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL

Python 1,860 111 Updated Nov 4, 2025
Python 11 1 Updated May 16, 2025

Fast and memory-efficient exact attention

Python 21,524 2,271 Updated Jan 10, 2026

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 4,553 384 Updated Jan 9, 2026

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 13,140 876 Updated Dec 17, 2024

CUDA Kernel Benchmarking Library

Cuda 797 97 Updated Jan 5, 2026

generation of training-optimised weather datasets from declarative syntax

Python 12 18 Updated Nov 3, 2025
Next