Skip to content
View histmeisah's full-sized avatar

Block or report histmeisah

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This repository summarizes recent advances in the VLA + RL paradigm and provides a taxonomic classification of relevant works.

325 4 Updated Oct 10, 2025

Pytorch PI-zero and PI-zero-fast. Adapted from LeRobot

Python 145 11 Updated Sep 2, 2025

Python SDK for TraceRoot

Python 36 2 Updated Nov 12, 2025

Find the Root Cause in Your Code's Trace

TypeScript 353 48 Updated Nov 6, 2025

Kimi K2 is the large language model series developed by Moonshot AI team

9,394 653 Updated Nov 7, 2025

An AI Powered README and Interactive Wiki Generator for Any Projects. AI驱动的README及交互式Wiki生成工具,面向中文的开源DeepWiki。

Python 377 22 Updated Aug 1, 2025

A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language Models

Python 49 8 Updated Apr 1, 2025

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 2,305 148 Updated Nov 14, 2025
Python 323 24 Updated Aug 29, 2025

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,176 101 Updated Oct 20, 2025

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 1,296 95 Updated Nov 14, 2025

ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning

Python 1,243 76 Updated May 16, 2025

Multinode reimplement for search-r1

Python 4 1 Updated Oct 6, 2025
Python 5 1 Updated Jun 4, 2025
Python 31 2 Updated Mar 10, 2025

PLM: Efficient Peripheral Language Models Hardware-Co-Designed for Ubiquitous Computing

20 3 Updated Mar 18, 2025

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 3,506 297 Updated Nov 13, 2025
Python 178 6 Updated Mar 13, 2025

A comic app

Dart 5,107 162 Updated Nov 2, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,688 2,529 Updated Nov 14, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 20,161 3,391 Updated Nov 15, 2025

A collection of the books and papers on data science and machine learning.

57 8 Updated Jan 30, 2023

Enabling Mixed Opponent Strategy Script and Self-play on SMAC

Python 38 4 Updated Jul 24, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,381 813 Updated Nov 9, 2025

Python library to decode StarCraft II replay protocols

Python 633 113 Updated Oct 8, 2025

OpenDILab Decision AI in StarCraftII

Python 3 1 Updated Jun 27, 2024

This project contains various scripts that can assist in the process of preparing datasets.

Python 3 3 Updated May 27, 2025

GPT4 powered AI coach to help you play on SC2 ladder

Jupyter Notebook 6 2 Updated Sep 29, 2025

Clustering for SCII build orders in patch >=5.0.3 based on the spawningtool and sc2reader replay parsers.

Jupyter Notebook 2 Updated May 5, 2021
Next