Skip to content
View TensorTemplar's full-sized avatar

Block or report TensorTemplar

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

MoE training for Me and You and maybe other people

Python 303 26 Updated Dec 17, 2025

Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)

Python 462 53 Updated Dec 26, 2025

A fancy self-hosted monitoring tool

JavaScript 80,259 7,160 Updated Dec 26, 2025

A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology

C++ 1,301 180 Updated Dec 17, 2025

RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots

Python 1,048 129 Updated Dec 18, 2025

MiniMax-M2, a model built for Max coding & agentic workflows.

2,162 166 Updated Nov 13, 2025

A Datacenter Scale Distributed Inference Serving Framework

Rust 5,690 756 Updated Dec 28, 2025

The best ChatGPT that $100 can buy.

Python 39,371 5,006 Updated Dec 28, 2025

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 3,176 195 Updated Oct 9, 2025

SQL databases in Python, designed for simplicity, compatibility, and robustness.

Python 17,400 791 Updated Dec 26, 2025

Official inference repo for FLUX.1 models

Python 24,971 1,828 Updated Jul 31, 2025

Post-training with Tinker

Python 2,627 266 Updated Dec 28, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 8,836 1,039 Updated Dec 24, 2025

Code implementation for the paper "Large-scale Pre-training for Grounded Video Caption Generation" (ICCV 2025)

Python 26 Updated Nov 9, 2025

Code and training scripts for FlexOlmo

Python 120 16 Updated Dec 18, 2025

Material for gpu-mode lectures

Jupyter Notebook 5,464 554 Updated Dec 8, 2025

An extremely fast Python package and project manager, written in Rust.

Rust 75,790 2,387 Updated Dec 28, 2025

Development repository for the Triton language and compiler

MLIR 17,959 2,473 Updated Dec 28, 2025

Renderer for the harmony response format to be used with gpt-oss

Rust 4,099 239 Updated Dec 15, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 6,011 784 Updated Dec 23, 2025

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

Python 12,753 1,272 Updated Oct 28, 2025

Standard Open Arm 100

5,066 425 Updated Dec 9, 2025

NanoGPT (124M) in 3 minutes

Python 4,035 534 Updated Dec 27, 2025

A PyTorch native platform for training generative AI models

Python 4,878 652 Updated Dec 27, 2025

MCP server that enables AI assistants to interact with Linear project management system through natural language, allowing users to retrieve, create, and update issues, projects, and teams.

TypeScript 121 25 Updated Sep 5, 2025
Python 36 8 Updated Aug 20, 2025
Python 1,513 220 Updated Jun 26, 2025
Python 91 13 Updated Dec 19, 2025

QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.

Python 26 3 Updated Dec 17, 2025
Next