Skip to content
View whpy's full-sized avatar

Block or report whpy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

⛄ Possibly the smallest compiler ever

JavaScript 28,468 2,885 Updated Feb 19, 2024

Foam-Agent: An end-to-end, composable multi-agent framework for automating CFD simulations in OpenFOAM. NeurIPS 2025 Machine Learning and the Physical Sciences Workshop.

Python 69 15 Updated Jan 2, 2026
Rust 158 10 Updated Jan 17, 2026

Remake of the original Super Mario Bros game.

C++ 329 84 Updated Feb 4, 2023

Learning in infinite dimension with neural operators.

Python 3,323 810 Updated Jan 12, 2026

本人的科研经验

9,872 526 Updated Jan 10, 2026

A repo for llm on ncnn

C++ 178 21 Updated Jan 2, 2026
Zig 2 Updated May 20, 2025

A linear algebra library for the Zig programming language

Zig 7 Updated Jul 22, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 6,095 800 Updated Jan 16, 2026
C++ 290 74 Updated Jan 18, 2021

An intuitive and low-overhead instrumentation tool for Python

Python 1,192 39 Updated Jul 8, 2025

Pytorch implementation for MeanFlow

Jupyter Notebook 293 24 Updated Jul 30, 2025

Variational Monte-Carlo updated PEPS

C++ 26 4 Updated Jan 10, 2026

JAX implementation of MeanFlow

Python 522 19 Updated Jul 30, 2025

Ongoing research training transformer models at scale

Python 14,932 3,497 Updated Jan 17, 2026

VideoNSA: Native Sparse Attention Scales Video Understanding

Python 78 2 Updated Nov 16, 2025

A library of GPU kernels for sparse matrix operations.

C++ 281 53 Updated Nov 24, 2020

Speedup the attention computation of Swin Transformer

Python 29 4 Updated Jun 14, 2025

A tiny deep learning training framework implemented from scratch in C++ that follows PyTorch's API.

C++ 144 26 Updated Dec 4, 2025

A really tiny autograd engine

Python 99 3 Updated May 26, 2025

LeetGPU Solutions

Python 95 5 Updated Oct 9, 2025

Pytorch Implementation (unofficial) of the paper "Mean Flows for One-step Generative Modeling" by Geng et al.

Python 1,021 58 Updated Dec 17, 2025

A Python toolkit for fine-tuning Geospatial Foundation Models (GFMs).

Python 702 128 Updated Jan 16, 2026

[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL

Python 1,886 117 Updated Nov 4, 2025
Python 11 1 Updated May 16, 2025

Fast and memory-efficient exact attention

Python 21,667 2,289 Updated Jan 17, 2026

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 4,735 401 Updated Jan 16, 2026

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 13,167 881 Updated Dec 17, 2024
Next