histmeisah

ma wei yu histmeisah

sc2 fans

26 followers · 8 following

UCAS
Beijing ,China
@meisah111

Achievements

Lists (3)

Sort

Stars

OpenHelix-Team / Awesome-VLA-RL

This repository summarizes recent advances in the VLA + RL paradigm and provides a taxonomic classification of relevant works.

325 4 Updated Oct 10, 2025

ZibinDong / openpi_pytorch

Pytorch PI-zero and PI-zero-fast. Adapted from LeRobot

Python 145 11 Updated Sep 2, 2025

traceroot-ai / traceroot-sdk

Python SDK for TraceRoot

Python 36 2 Updated Nov 12, 2025

traceroot-ai / traceroot

Find the Root Cause in Your Code's Trace

TypeScript 353 48 Updated Nov 6, 2025

MoonshotAI / Kimi-K2

Kimi K2 is the large language model series developed by Moonshot AI team

9,394 653 Updated Nov 7, 2025

aibox22 / readmeX

An AI Powered README and Interactive Wiki Generator for Any Projects. AI驱动的README及交互式Wiki生成工具，面向中文的开源DeepWiki。

Python 377 22 Updated Aug 1, 2025

devindeng94 / LLM-SMAC

A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language Models

Python 49 8 Updated Apr 1, 2025

alibaba / ROLL

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 2,305 148 Updated Nov 14, 2025

InternLM / InternBootcamp

Python 323 24 Updated Aug 29, 2025

langfengQ / verl-agent

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,176 101 Updated Oct 20, 2025

ByteDance-Seed / VeOmni

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 1,296 95 Updated Nov 14, 2025

Agent-RL / ReCall

ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning

Python 1,243 76 Updated May 16, 2025

histmeisah / Search_r1_multinode

Multinode reimplement for search-r1

Python 4 1 Updated Oct 6, 2025

HyperONE27 / replay_fix

Python 5 1 Updated Jun 4, 2025

camel-ai / VLM-Play-StarCraft2

Python 31 2 Updated Mar 10, 2025

plm-team / PLM

PLM: Efficient Peripheral Language Models Hardware-Co-Designed for Ubiquitous Computing

20 3 Updated Mar 18, 2025

PeterGriffinJin / Search-R1

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 3,506 297 Updated Nov 13, 2025

whcpumpkin / auto-vv-machine

Python 178 6 Updated Mar 13, 2025

venera-app / venera

A comic app

Dart 5,107 162 Updated Nov 2, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,688 2,529 Updated Nov 14, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 20,161 3,391 Updated Nov 15, 2025

AI-for-starcraft2 / mod-for-starcraft2

HTML 2 1 Updated Nov 18, 2021

ryanluoli1 / Machine-Learning-Books-and-Papers

A collection of the books and papers on data science and machine learning.

57 8 Updated Jan 30, 2023

devindeng94 / smac-hard

Enabling Mixed Opponent Strategy Script and Self-play on SMAC

Python 38 4 Updated Jul 24, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,381 813 Updated Nov 9, 2025

Blizzard / s2protocol

Python library to decode StarCraft II replay protocols

Python 633 113 Updated Oct 8, 2025

toncula / DI-star

Forked from opendilab/DI-star

OpenDILab Decision AI in StarCraftII

Python 3 1 Updated Jun 27, 2024

ma wei yu histmeisah

Lists (3)

llm

RL

starcraft2

Stars