Skip to content
View gabteni's full-sized avatar

Block or report gabteni

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

Cuda 2,648 259 Updated Nov 6, 2025

A curated collection of public industrial datasets.

HTML 264 34 Updated Aug 28, 2025
Python 20 3 Updated Sep 19, 2025

From babyGPT to diffusion GPT: An annotated implementation of a character-level discrete diffusion model (adapted from Karpathy’s baby GPT).

Jupyter Notebook 231 21 Updated Oct 12, 2025

Python GUI builder. GUI builder for Tkinter, CustomTkinter, Kivy and PySide (upcoming)

JavaScript 1,874 149 Updated Sep 26, 2025

A modular framework for neural networks with Euclidean symmetry

Python 1,171 171 Updated Oct 7, 2025

Optimize prompts, code, and more with AI-powered Reflective Text Evolution

Jupyter Notebook 1,504 108 Updated Nov 8, 2025

Training-Ready RL Environments + Evals

Python 170 186 Updated Nov 10, 2025

Open-source implementation of AlphaEvolve

Python 4,488 665 Updated Nov 1, 2025

JDM Editor is an open-source React component for crafting and designing JDM (JSON Decision model) files.

TypeScript 240 82 Updated Nov 8, 2025

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,822 2,670 Updated Jul 3, 2025

Implementation and evaluation of the AXIOM architecture from the preprint "AXIOM: Learning to Play Games in Minutes with Expanding Object Centric Models"

Python 55 14 Updated Jun 2, 2025

Open-source framework for the research and development of foundation models.

HTML 603 56 Updated Nov 10, 2025

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 8,380 829 Updated Nov 6, 2025

🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL

Python 371 43 Updated Oct 29, 2025

🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX

Python 60 5 Updated Oct 23, 2023

Modular, scalable library to train ML models

Python 169 17 Updated Nov 10, 2025

Context7 MCP Server -- Up-to-date code documentation for LLMs and AI code editors

JavaScript 36,719 1,814 Updated Nov 10, 2025

Qlib is an AI-oriented Quant investment platform that aims to use AI tech to empower Quant Research, from exploring ideas to implementing productions. Qlib supports diverse ML modeling paradigms, i…

Python 33,563 5,176 Updated Nov 10, 2025

Minimal yet performant LLM examples in pure JAX

Python 195 24 Updated Sep 23, 2025

React UI + elegant infrastructure for AI Copilots, AI chatbots, and in-app AI agents. The Agentic last-mile 🪁

TypeScript 24,835 3,317 Updated Nov 10, 2025

A Model Context Protocol (MCP) server that enables AI assistants to interact with Kubernetes clusters. It serves as a bridge between AI tools (like Claude, Cursor, and GitHub Copilot) and Kubernetes

Go 43 10 Updated Nov 4, 2025

A Model Context Protocol (MCP) server that enables AI assistants to interact with Kubernetes clusters. It serves as a bridge between AI tools (like Claude, Cursor, and GitHub Copilot) and Kubernete…

Python 12 5 Updated Oct 29, 2025

Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs

HTML 683 99 Updated Oct 22, 2025

RADLADS training code

Python 34 3 Updated May 7, 2025

Open-source Windows and Office activator featuring HWID, Ohook, TSforge, KMS38, and Online KMS activation methods, along with advanced troubleshooting.

Batchfile 155,459 14,996 Updated Nov 9, 2025

Timely detections for more proactive and effective actions in offshore oil wells!

Jupyter Notebook 449 99 Updated Nov 6, 2025

C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.

C++ 1,205 117 Updated Aug 12, 2024

A Full Live-Scripted CAD Kernel in the Browser

JavaScript 1,256 157 Updated Jan 2, 2025

maximal update parametrization (µP)

Jupyter Notebook 1,621 104 Updated Jul 17, 2024
Next