Skip to content
View i-Pear's full-sized avatar
:electron:
practicing magic
:electron:
practicing magic
  • Nanjing University
  • 14:24 (UTC +08:00)

Highlights

  • Pro

Organizations

@NEUP-Net-Depart @gsoc-cn @unikraft @HMUniversity @ipearworks

Block or report i-Pear

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 3,930 315 Updated Nov 15, 2025

Programmable CUDA/C++ GPU Graph Analytics

C++ 1,044 215 Updated Jul 30, 2024

MLIR-based partitioning system

MLIR 148 25 Updated Nov 15, 2025

PROPELLER: Profile Guided Optimizing Large Scale LLVM-based Relinker

C++ 463 45 Updated Nov 13, 2025

Training and serving large-scale neural networks with auto parallelization.

Python 3,165 354 Updated Dec 9, 2023

PyTorch extensions for high performance and large scale training.

Python 3,385 294 Updated Apr 26, 2025

Making large AI models cheaper, faster and more accessible

Python 41,238 4,540 Updated Nov 13, 2025

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 2,190 365 Updated Aug 14, 2025

LingoDB: A new analytical database system that blurs the lines between databases and compilers.

C++ 284 52 Updated Nov 14, 2025

A cross platform way to express data transformation, relational algebra, standardized record expression and plans.

Python 1,427 187 Updated Nov 16, 2025

卢瑟们的作业展示,答案讲解,以及一些C++知识

C++ 744 140 Updated Oct 6, 2025

A list of bugs found by SQLancer

Python 17 7 Updated Jan 30, 2024

Optimizing SGEMM kernel functions on NVIDIA GPUs to a close-to-cuBLAS performance.

Cuda 391 52 Updated Jan 2, 2025

2025届互联网校招信息汇总

856 51 Updated Feb 24, 2025

Material for gpu-mode lectures

Jupyter Notebook 5,296 533 Updated Sep 23, 2025

A retargetable MLIR-based machine learning compiler and runtime toolkit.

C++ 3,456 792 Updated Nov 16, 2025

A lightweight memory allocator for hardware-accelerated machine learning

C++ 173 15 Updated Sep 30, 2025

Keep your bugs contained. A platform for studying historical software bugs.

Python 69 12 Updated Jan 8, 2025

“Debian 小药盒”,一个用来包装 Debian 安装介质的盒子设计和介绍用的说明书。

TeX 1,487 72 Updated Aug 10, 2025

A massively parallel, optimal functional runtime in Rust

Cuda 11,154 426 Updated Nov 21, 2024

💥💻💥 A data-parallel functional programming language

Haskell 2,616 189 Updated Nov 14, 2025

A massively parallel, high-level programming language

Rust 19,083 465 Updated Jun 3, 2025

Training materials provided by OpenACC.org.

C 95 28 Updated Aug 6, 2024

C/C++ frontend for MLIR. Also features polyhedral optimizations, parallel optimizations, and more!

C++ 584 151 Updated Jun 19, 2025

Mirage Persistent Kernel: Compiling LLMs into a MegaKernel

C++ 1,945 153 Updated Nov 15, 2025

NO TIME TO SLEEP

Python 647 25 Updated May 26, 2024

Tile primitives for speedy kernels

Cuda 2,908 195 Updated Nov 15, 2025

Hands-On Practical MLIR Tutorial

C++ 654 96 Updated Oct 20, 2023

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 10,036 1,672 Updated Nov 15, 2025

A list of awesome compiler projects and papers for tensor computation and deep learning.

2,673 320 Updated Oct 19, 2024
Next