Skip to content
View tosterberg's full-sized avatar

Block or report tosterberg

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Direct File

JavaScript 4,447 1,346 Updated Jun 5, 2025

Architectural Metapatterns book and wiki

Python 664 53 Updated Nov 8, 2025

Flexible and powerful framework for managing multiple AI agents and handling complex conversations

Python 7,044 643 Updated Oct 21, 2025

🙌 OpenHands: Code Less, Make More

Python 64,836 7,877 Updated Nov 9, 2025

Literature references for “Designing Data-Intensive Applications”

6,668 854 Updated Oct 29, 2025

A curated list of software and architecture related design patterns.

44,716 3,146 Updated Oct 25, 2024

Learn Low Level Design (LLD) and prepare for interviews using free resources.

Java 19,252 4,771 Updated Oct 25, 2025

Curated Data Science resources (Free & Paid) to help aspiring and experienced data scientists learn, grow, and advance their careers.

1,296 135 Updated Sep 26, 2025

A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and fully reproducible.

Python 625 36 Updated Mar 23, 2025

AWS CDK Builder is a browser-based tool designed to streamline bootstrapping of Infrastructure as Code (IaC) projects using the AWS Cloud Development Kit (CDK).

TypeScript 204 10 Updated Aug 1, 2024

SQL tips and tricks

SQL 2,246 99 Updated Oct 10, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 20,072 3,315 Updated Nov 10, 2025

An Engine-Agnostic Deep Learning Framework in Java

Java 1 Updated Jan 10, 2025

Demo applications showcasing DJL

Jupyter Notebook 1 Updated Jul 23, 2024

A universal scalable machine learning model deployment solution

Java 1 Updated Oct 30, 2024

Open-source Android/Desktop remake of Civ V

Kotlin 9,745 1,755 Updated Nov 8, 2025

Multiplayer top-down shooter made from scratch in C++. Play in your Browser! https://hypersomnia.io Made in 🇵🇱

C++ 1,354 75 Updated Nov 2, 2025

Procedural tree generator written with JavaScript and Three.js

JavaScript 799 98 Updated Jan 14, 2025

A library for training and deploying machine learning models on Amazon SageMaker

Python 2,204 1,201 Updated Nov 7, 2025

Tools to Design or Visualize Architecture of Neural Network

5,148 625 Updated Aug 1, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 21 13 Updated Nov 7, 2025

Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3 (NeurIPS'25).

Python 1,981 221 Updated Nov 5, 2025

Efficent platform for inference and serving local LLMs including an OpenAI compatible API server.

Rust 514 57 Updated Nov 4, 2025

This is an online course where you can learn and master the skill of low-level performance analysis and tuning.

C++ 3,328 326 Updated Nov 9, 2025

Easy, fast and very cheap training and inference on AWS Trainium and Inferentia chips.

Jupyter Notebook 246 88 Updated Nov 7, 2025

[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

Python 5,556 329 Updated Oct 28, 2025

Official NetHack Git Repository

C 3,368 513 Updated Nov 10, 2025

[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

Python 1,297 78 Updated Mar 6, 2025
Next