Playing around "Less Slow" coding practices in C++ 20, C, CUDA, PTX, & Assembly, from numerics & SIMD to coroutines, ranges, exception handling, networking and user-space IO

C++ 1,875 79 Updated Sep 10, 2025

ClickHouse / clickhouse-presentations

Presentations, meetups and talks about ClickHouse

HTML 1,060 190 Updated Nov 25, 2025

Tessil / hopscotch-map

C++ implementation of a fast hash map and hash set using hopscotch hashing

C++ 763 68 Updated Nov 2, 2025

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 37,776 4,645 Updated Nov 17, 2025

scandum / quadsort

Quadsort is a branchless stable adaptive mergesort faster than quicksort.

C 2,175 109 Updated Jul 27, 2024

scandum / crumsort

A branchless unstable quicksort / mergesort that is highly adaptive.

C 334 10 Updated Jul 27, 2024

ClickHouse / ClickHouse

ClickHouse® is a real-time analytics database management system

C++ 44,344 7,865 Updated Nov 29, 2025

Neargye / magic_enum

Static reflection for enums (to string, from string, iteration) for modern C++, work with any enum type without any macro or boilerplate code

C++ 5,822 521 Updated Nov 21, 2025

orlp / pdqsort

Pattern-defeating quicksort.

C++ 2,452 98 Updated Dec 6, 2023

brucefan1983 / CUDA-Programming

Sample codes for my CUDA programming book

Cuda 1,937 377 Updated Feb 15, 2025

gpu-mode / lectures

Material for gpu-mode lectures

Jupyter Notebook 5,358 540 Updated Nov 21, 2025

CodedK / CUDA-by-Example-source-code-for-the-book-s-examples-

CUDA by Example, written by two senior members of the CUDA software platform team, shows programmers how to employ this new technology. The authors introduce each area of CUDA development through w…

C 459 147 Updated Jun 30, 2023

alibaba / TorchEasyRec

An easy-to-use framework for large scale recommendation algorithms.

Python 270 52 Updated Nov 27, 2025

Fish🐟 imdouyu

Lists (7)

C++

Deep Learning

Interview

Learn

LLM

Misc

Tools

Stars