Skip to content
View gohar94's full-sized avatar

Highlights

  • Pro

Block or report gohar94

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 31,120 3,831 Updated Jan 13, 2026

NVIDIA Linux open GPU kernel module source

C 16,612 1,562 Updated Dec 18, 2025

Official JAX implementation of End-to-End Test-Time Training for Long Context

Python 281 14 Updated Dec 29, 2025
298 26 Updated Sep 23, 2025

A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology

C++ 1,319 180 Updated Dec 17, 2025
Jupyter Notebook 20 2 Updated May 18, 2025

Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Python 1,302 85 Updated Jul 14, 2024

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 2,903 318 Updated Jan 6, 2026

Nano vLLM

Python 10,717 1,373 Updated Nov 3, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.

Python 50,622 4,175 Updated Jan 12, 2026

Resource Multiplexing in Tuning and Serving Large Language Models (USENIX ATC 2025)

Python 7 5 Updated May 16, 2025

Naive attempt at implementing TTT paper by letting autograd do the heavy lifting

Python 8 Updated Apr 20, 2025

Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch

Python 1,873 189 Updated Jan 7, 2026

dspy-cli is a tool for creating, developing, testing, and deploying DSPy programs as HTTP APIs.

Python 106 6 Updated Jan 9, 2026

Kernel Tuner

Python 379 60 Updated Jan 12, 2026

NVIDIA Linux open GPU with P2P support

C 11 1 Updated Jan 6, 2026

Artifact from "Hardware Compute Partitioning on NVIDIA GPUs". THIS IS A FORK OF BAKITAS REPO. I AM NOT ONE OF THE AUTHORS OF THE PAPER.

C 48 3 Updated Nov 24, 2025

Dynamic Memory Management for Serving LLMs without PagedAttention

C 454 35 Updated May 30, 2025

The official implementation of the ICML 2024 paper "MemoryLLM: Towards Self-Updatable Large Language Models" and "M+: Extending MemoryLLM with Scalable Long-Term Memory"

Python 284 25 Updated Jul 28, 2025

The open-source RAG platform: built-in citations, deep research, 22+ file formats, partitions, MCP server, and more.

TypeScript 1,785 152 Updated Jan 8, 2026

The best ChatGPT that $100 can buy.

Python 40,205 5,177 Updated Jan 12, 2026

Contexts Optical Compression

Python 21,996 2,004 Updated Oct 25, 2025

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 10,257 756 Updated Jan 10, 2026

A Datacenter Scale Distributed Inference Serving Framework

Rust 5,768 776 Updated Jan 13, 2026

Train transformer language models with reinforcement learning.

Python 16,936 2,417 Updated Jan 12, 2026

A framework for optimizing DSPy programs with RL

Python 304 27 Updated Jan 12, 2026

Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"

Python 342 32 Updated Nov 10, 2025
Next