tugot17

Follow

Piotr Mazurek tugot17

Follow

Making LLMs go brrr @Aleph__Alpha

130 followers · 51 following

Achievements

Achievements

Stars

IST-DASLab / marlin

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Python 949 78 Updated Sep 4, 2024

PufferAI / PufferLib

Simplifying reinforcement learning for complex game environments

C 4,228 306 Updated Nov 15, 2025

brendanhogan / nano-grpo-envs

Python 10 1 Updated Oct 17, 2025

patrick-kidger / jaxtyping

Type annotations and runtime checking for shape and dtype of JAX/NumPy/PyTorch/etc. arrays. https://docs.kidger.site/jaxtyping/

Python 1,640 80 Updated Nov 15, 2025

srush / Tensor-Puzzles

Solve puzzles. Improve your pytorch.

Jupyter Notebook 3,775 341 Updated Jul 15, 2024

ServiceNow / PipelineRL

A scalable asynchronous reinforcement learning implementation with in-flight weight updates.

Python 301 28 Updated Nov 14, 2025

NousResearch / atropos

Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse environments

Python 744 169 Updated Nov 14, 2025

deepseek-ai / DeepSeek-V3.2-Exp

Python 979 71 Updated Oct 2, 2025

PRIME-RL / PRIME

Scalable RL solution for advanced reasoning of language models

Python 1,768 99 Updated Mar 18, 2025

alexarmbr / matmul-playground

Cuda 19 5 Updated Apr 7, 2025

thinking-machines-lab / batch_invariant_ops

Python 897 68 Updated Nov 4, 2025

Alibaba-NLP / DeepResearch

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,179 1,304 Updated Nov 15, 2025

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes/codes for ML SYS.

Python 4,172 253 Updated Nov 14, 2025

NovaSky-AI / SkyRL

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,201 169 Updated Nov 16, 2025

LeonGuertler / TextArena

A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning

Python 310 75 Updated Oct 29, 2025

gepa-ai / gepa

Optimize prompts, code, and more with AI-powered Reflective Text Evolution

Jupyter Notebook 1,568 115 Updated Nov 12, 2025

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 2,486 259 Updated Nov 16, 2025

PrimeIntellect-ai / prime-rl

Async RL Training at Scale

Python 770 134 Updated Nov 16, 2025

JannikSt / ibtop

Real-time terminal monitor for InfiniBand networks - htop for high-speed interconnects

Rust 45 1 Updated Sep 3, 2025

Tencent-Hunyuan / HunyuanWorld-Voyager

Voyager is an interactive RGBD video generation model conditioned on camera input, and supports real-time 3D reconstruction.

Python 1,340 128 Updated Oct 22, 2025

ChenmienTan / RL2

Python 916 97 Updated Nov 14, 2025

PrimeIntellect-ai / prime-environments

Training-Ready RL Environments + Evals

Python 175 189 Updated Nov 16, 2025

Tencent-Hunyuan / Hunyuan-GameCraft-1.0

Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition

Python 616 68 Updated Oct 16, 2025

pytorch / torchtitan

A PyTorch native platform for training generative AI models

Python 4,713 603 Updated Nov 16, 2025

mlfoundations / evalchemy

Automatic evals for LLMs

HTML 557 68 Updated Jun 27, 2025

PrimeIntellect-ai / verifiers

Environments for LLM Reinforcement Learning

Python 3,487 431 Updated Nov 16, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,758 2,537 Updated Nov 15, 2025

Aleph-Alpha / vllm

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 18 Updated Nov 14, 2025

QwenLM / qwen-code

Qwen Code is a coding agent that lives in the digital world.

TypeScript 15,475 1,281 Updated Nov 14, 2025

BerriAI / litellm

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…

Python 31,123 4,716 Updated Nov 16, 2025