Skip to content
View gengala's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report gengala

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Visualizer for neural network, deep learning and machine learning models

JavaScript 31,876 3,033 Updated Nov 26, 2025

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 40,642 3,154 Updated Nov 26, 2025

The best ChatGPT that $100 can buy.

Python 37,614 4,608 Updated Nov 17, 2025

Repo for "LoLCATs: On Low-Rank Linearizing of Large Language Models"

Python 249 25 Updated Jan 31, 2025

A bunch of triton kernels with increasing complexity for learning and exploring triton and GPU programming

Python 1 Updated Aug 1, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,307 1,945 Updated Nov 1, 2025

OLMo-core ported for Snellius

Python 2 Updated Jul 15, 2025

Code accompanying the paper "Generalized Interpolating Discrete Diffusion"

Python 108 16 Updated Jun 9, 2025

Minimalistic large language model 3D-parallelism training

Python 2,337 257 Updated Nov 21, 2025

TransMLA: Multi-Head Latent Attention Is All You Need (NeurIPS 2025 Spotlight)

Python 413 24 Updated Sep 23, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 79,840 11,883 Updated Nov 25, 2025

Efficient Triton Kernels for LLM Training

Python 5,876 438 Updated Nov 23, 2025

PyTorch building blocks for the OLMo ecosystem

Python 435 67 Updated Nov 26, 2025

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Jupyter Notebook 13,888 2,060 Updated Aug 8, 2024

Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.

Go 156,625 13,753 Updated Nov 26, 2025

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 4,779 451 Updated Nov 26, 2025

An extension of the nanoGPT repository for training small MOE models.

Python 215 26 Updated Mar 9, 2025

Stanford Drone Dataset with non-convex Constraints

Jupyter Notebook 7 Updated Nov 25, 2025
Python 7 Updated Feb 3, 2025

Sum-of-squares Non-monotonic Probabilistic Circuits

Python 7 Updated Jan 16, 2025
Python 3 Updated Feb 3, 2025

Tensor Network Learning with PyTorch

Python 302 44 Updated May 23, 2024

A computer algebra system written in pure Python

Python 14,131 4,879 Updated Nov 26, 2025

Code for "TabZilla: When Do Neural Nets Outperform Boosted Trees on Tabular Data?"

HTML 173 37 Updated Mar 22, 2024

Official implementation of E(n)-equivariant Graph Neural Cellular Automata

Jupyter Notebook 35 5 Updated Oct 3, 2025

A New Modeling Framework for Continuous, Sequential Domains

Jupyter Notebook 2 1 Updated Jun 16, 2024

Code release for Hoogeboom, Emiel, Jorn WT Peters, Rianne van den Berg, and Max Welling. "Integer Discrete Flows and Lossless Compression." Conference on Neural Information Processing Systems (2019).

Python 100 15 Updated Nov 29, 2019

LLM training in simple, raw C/CUDA

Cuda 28,254 3,297 Updated Jun 26, 2025
Next