idning

Lin idning

406 followers · 50 following

Bay area

Achievements

Stars

tspeterkim / flash-attention-minimal

Flash Attention in ~100 lines of CUDA (forward pass only)

Cuda 946 96 Updated Dec 30, 2024

kyegomez / FlashAttention20Triton

Triton implementation of Flash Attention2.0

Python 40 5 Updated Jul 31, 2023

open-thought / reasoning-gym

[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Python 1,189 97 Updated Oct 6, 2025

deepseek-ai / DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,798 717 Updated Oct 15, 2025

kvcache-ai / ktransformers

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 15,184 1,092 Updated Oct 12, 2025

open-thought / tiny-grpo

Minimal hackable GRPO implementation

Python 293 41 Updated Jan 31, 2025

MaximeVandegar / Papers-in-100-Lines-of-Code

Implementation of papers in 100 lines of code.

Python 1,630 171 Updated Oct 12, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,180 800 Updated Oct 9, 2025

Jiayi-Pan / TinyZero

Minimal reproduction of DeepSeek R1-Zero

Python 12,273 1,512 Updated Apr 24, 2025

BlackHC / batch_pong_poc

Instead of running one environment at a time or one per thread, run everything in batch using numpy on a single core.

Jupyter Notebook 5 2 Updated Feb 19, 2018

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 25,552 2,395 Updated Sep 8, 2025

fla-org / flash-linear-attention

🚀 Efficient implementations of state-of-the-art linear attention models

Python 3,515 267 Updated Oct 17, 2025

pytorch / torchtitan

A PyTorch native platform for training generative AI models

Python 4,552 565 Updated Oct 17, 2025

NVIDIA / TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory…

Python 2,814 523 Updated Oct 17, 2025