gohar94

Follow

Gohar Irfan Chaudhry gohar94

Follow

PhD student @ MIT CSAIL

36 followers · 14 following

@mit
Cambridge, Massachusetts
http://goharirfan.me
@gohar94

Achievements

Achievements

Highlights

Pro

Stars

tinygrad / tinygrad

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 31,120 3,831 Updated Jan 13, 2026

mikex86 / LibreCuda

C 1,075 41 Updated May 18, 2025

NVIDIA / open-gpu-kernel-modules

NVIDIA Linux open GPU kernel module source

C 16,612 1,562 Updated Dec 18, 2025

test-time-training / e2e

Official JAX implementation of End-to-End Test-Time Training for Long Context

Python 281 14 Updated Dec 29, 2025

NVlabs / NVBit

298 26 Updated Sep 23, 2025

NVIDIA / gdrcopy

A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology

C++ 1,319 180 Updated Dec 17, 2025

rtenlab / gcaps-super-repo

C 13 1 Updated May 17, 2024

thustorage / GPreempt

Jupyter Notebook 20 2 Updated May 18, 2025

test-time-training / ttt-lm-pytorch

Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Python 1,302 85 Updated Jul 14, 2024

sgl-project / mini-sglang

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 2,903 318 Updated Jan 6, 2026

GeeeekExplorer / nano-vllm

Nano vLLM

Python 10,717 1,373 Updated Nov 3, 2025

unslothai / unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.

Python 50,622 4,175 Updated Jan 12, 2026

llm-db / llmstation

Resource Multiplexing in Tuning and Serving Large Language Models (USENIX ATC 2025)

Python 7 5 Updated May 16, 2025

sentialx / shiTTT

Naive attempt at implementing TTT paper by letting autograd do the heavy lifting

Python 8 Updated Apr 20, 2025

lucidrains / titans-pytorch

Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch

Python 1,873 189 Updated Jan 7, 2026

cmpnd-ai / dspy-cli

dspy-cli is a tool for creating, developing, testing, and deploying DSPy programs as HTTP APIs.

Python 106 6 Updated Jan 9, 2026

KernelTuner / kernel_tuner

Kernel Tuner

Python 379 60 Updated Jan 12, 2026

CXLMemUring / cxl_pytorch_expander

Python 8 1 Updated Dec 19, 2025

CXLMemUring / open-gpu-kernel-modules

Forked from aikitoria/open-gpu-kernel-modules

NVIDIA Linux open GPU with P2P support

C 11 1 Updated Jan 6, 2026

atomicapple0 / libsmctrl

Artifact from "Hardware Compute Partitioning on NVIDIA GPUs". THIS IS A FORK OF BAKITAS REPO. I AM NOT ONE OF THE AUTHORS OF THE PAPER.

C 48 3 Updated Nov 24, 2025

microsoft / vattention

Dynamic Memory Management for Serving LLMs without PagedAttention

C 454 35 Updated May 30, 2025

wangyu-ustc / MemoryLLM

The official implementation of the ICML 2024 paper "MemoryLLM: Towards Self-Updatable Large Language Models" and "M+: Extending MemoryLLM with Scalable Long-Term Memory"

Python 284 25 Updated Jul 28, 2025

agentset-ai / agentset

The open-source RAG platform: built-in citations, deep research, 22+ file formats, partitions, MCP server, and more.

TypeScript 1,785 152 Updated Jan 8, 2026

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 40,205 5,177 Updated Jan 12, 2026

deepseek-ai / DeepSeek-OCR

Contexts Optical Compression

Python 21,996 2,004 Updated Oct 25, 2025

facebookresearch / xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 10,257 756 Updated Jan 10, 2026

ai-dynamo / dynamo

A Datacenter Scale Distributed Inference Serving Framework

Rust 5,768 776 Updated Jan 13, 2026

huggingface / trl

Train transformer language models with reinforcement learning.

Python 16,936 2,417 Updated Jan 12, 2026

Ziems / arbor

A framework for optimizing DSPy programs with RL

Python 304 27 Updated Jan 12, 2026

ekinakyurek / marc

Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"

Python 342 32 Updated Nov 10, 2025