Skip to content
View tugot17's full-sized avatar

Block or report tugot17

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Python 949 78 Updated Sep 4, 2024

Simplifying reinforcement learning for complex game environments

C 4,228 306 Updated Nov 15, 2025
Python 10 1 Updated Oct 17, 2025

Type annotations and runtime checking for shape and dtype of JAX/NumPy/PyTorch/etc. arrays. https://docs.kidger.site/jaxtyping/

Python 1,640 80 Updated Nov 15, 2025

Solve puzzles. Improve your pytorch.

Jupyter Notebook 3,775 341 Updated Jul 15, 2024

A scalable asynchronous reinforcement learning implementation with in-flight weight updates.

Python 301 28 Updated Nov 14, 2025

Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse environments

Python 744 169 Updated Nov 14, 2025

Scalable RL solution for advanced reasoning of language models

Python 1,768 99 Updated Mar 18, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,179 1,304 Updated Nov 15, 2025

My learning notes/codes for ML SYS.

Python 4,172 253 Updated Nov 14, 2025

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,201 169 Updated Nov 16, 2025

A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning

Python 310 75 Updated Oct 29, 2025

Optimize prompts, code, and more with AI-powered Reflective Text Evolution

Jupyter Notebook 1,568 115 Updated Nov 12, 2025

slime is an LLM post-training framework for RL Scaling.

Python 2,486 259 Updated Nov 16, 2025

Async RL Training at Scale

Python 770 134 Updated Nov 16, 2025

Real-time terminal monitor for InfiniBand networks - htop for high-speed interconnects

Rust 45 1 Updated Sep 3, 2025

Voyager is an interactive RGBD video generation model conditioned on camera input, and supports real-time 3D reconstruction.

Python 1,340 128 Updated Oct 22, 2025
Python 916 97 Updated Nov 14, 2025

Training-Ready RL Environments + Evals

Python 175 189 Updated Nov 16, 2025

Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition

Python 616 68 Updated Oct 16, 2025

A PyTorch native platform for training generative AI models

Python 4,713 603 Updated Nov 16, 2025

Automatic evals for LLMs

HTML 557 68 Updated Jun 27, 2025

Environments for LLM Reinforcement Learning

Python 3,487 431 Updated Nov 16, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,758 2,537 Updated Nov 15, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 18 Updated Nov 14, 2025

Qwen Code is a coding agent that lives in the digital world.

TypeScript 15,475 1,281 Updated Nov 14, 2025

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…

Python 31,123 4,716 Updated Nov 16, 2025
Next