Skip to content
View gmittal's full-sized avatar

Block or report gmittal

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

SCUDA is a GPU over IP bridge allowing GPUs on remote machines to be attached to CPU-only machines.

C++ 1,797 78 Updated Jan 4, 2026

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Jupyter Notebook 2,136 229 Updated Aug 17, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 10,255 994 Updated Jul 1, 2024

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Python 899 49 Updated Sep 30, 2025

MLX: An array framework for Apple silicon

C++ 23,403 1,447 Updated Jan 8, 2026

The official Porsche Design System repository, offering fundamental UXI guidelines and a library of reusable web components to enable designers and developers to build consistent, intuitive, and hi…

TypeScript 565 38 Updated Jan 8, 2026
Python 31 7 Updated Jan 9, 2025

A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.

Python 2,047 231 Updated Oct 16, 2025

Inference Llama 2 in one file of pure 🔥

Mojo 2,116 136 Updated Nov 30, 2025

Python pdb for multiple processes

Python 76 9 Updated May 24, 2025

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,865 586 Updated May 3, 2024

Inference code for CodeLlama models

Python 16,364 1,944 Updated Aug 12, 2024

[EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674

Python 196 13 Updated Jun 14, 2023

Supercharge Your LLM Application Evaluations 🚀

Python 12,138 1,203 Updated Jan 5, 2026

commaVQ is a dataset of compressed driving video

Jupyter Notebook 340 61 Updated Oct 31, 2025

Tools for building GPU clusters

Shell 1,408 350 Updated Jan 9, 2026

LLMs for your CLI

Python 1,358 78 Updated May 29, 2024

A Data Streaming Library for Efficient Neural Network Training

Python 1,440 181 Updated Oct 27, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 67,166 12,485 Updated Jan 9, 2026

Universal memory layer for AI Agents

Python 45,272 4,937 Updated Jan 3, 2026

Write scalable load tests in plain Python 🚗💨

Python 27,312 3,159 Updated Jan 8, 2026

CUDA on non-NVIDIA GPUs

Rust 13,772 888 Updated Jan 8, 2026

It's React, but in Python

Python 8,151 331 Updated Dec 22, 2025

Implementation of Flash Attention in Jax

Python 223 24 Updated Mar 1, 2024

The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”

Python 981 57 Updated Jan 30, 2024

A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.

Python 841 63 Updated Jul 1, 2024

Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.

Python 714 121 Updated Dec 14, 2025

The Modular Platform (includes MAX & Mojo)

Mojo 25,426 2,759 Updated Jan 9, 2026

The Official Python Client for Lamini's API

Python 2,540 154 Updated Apr 7, 2025

Tiny data-over-sound library

C++ 7,412 434 Updated Aug 26, 2025
Next