Skip to content
View hijkzzz's full-sized avatar

Block or report hijkzzz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Async RL Training at Scale

Python 859 143 Updated Nov 29, 2025

"AI-Trader: Can AI Beat the Market?" Live Trading Bench: https://ai4trade.ai

Python 9,707 1,525 Updated Nov 26, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 12,148 1,384 Updated Nov 14, 2025

Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support

Python 190 25 Updated Nov 29, 2025

PyTorch native quantization and sparsity for training and inference

Python 2,540 376 Updated Nov 27, 2025
Python 842 45 Updated Sep 15, 2025

A modern GUI client based on Tauri, designed to run in Windows, macOS and Linux for tailored proxy experience

TypeScript 84,733 6,255 Updated Nov 29, 2025

WFGY 2.0. Semantic Reasoning Engine for LLMs (MIT). Fixes RAG/OCR drift, collapse & “ghost matches” via symbolic overlays + logic patches. Autoboot; OneLine & Flagship. ⭐ Star if you explore semant…

Python 1,266 106 Updated Oct 14, 2025

An Open-Source Large-Scale Reinforcement Learning Project for Search Agents

Python 503 31 Updated Nov 26, 2025

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 2,287 294 Updated Nov 27, 2025

ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)

Python 254 27 Updated Nov 27, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,315 1,948 Updated Nov 1, 2025

Bridge Megatron-Core to Hugging Face/Reinforcement Learning

Python 165 34 Updated Nov 27, 2025

(best/better) practices of megatron on veRL and tuning guide

Shell 103 8 Updated Sep 26, 2025

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

Shell 43,881 3,017 Updated Nov 27, 2025

A Gym for Agentic LLMs

Python 368 25 Updated Nov 10, 2025

End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

Python 330 18 Updated Sep 22, 2025

Train your Agent model via our easy and efficient framework

Python 1,634 155 Updated Nov 17, 2025

An open-source, code-first Python toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.

Python 15,806 2,500 Updated Nov 27, 2025

[COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents

Python 198 38 Updated Jul 13, 2025

Scaling RL on advanced reasoning models

Python 640 40 Updated Oct 20, 2025

MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.

Python 2,999 264 Updated Jul 7, 2025

Nano vLLM

Python 9,353 1,153 Updated Nov 3, 2025

Custom FlashAttention 2.7.4.post1 wheels for PyTorch 2.7.0+cu128 (CUDA 12.8) on Linux x86_64

4 Updated May 8, 2025
Python 27 6 Updated Nov 26, 2025
Python 344 20 Updated Jul 29, 2025

A live stream development of RL tunning for LLM agents

Python 3,633 505 Updated Oct 8, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 64,213 11,633 Updated Nov 29, 2025
Python 86 2 Updated Aug 16, 2025
Next