Lists (3)
Sort Name ascending (A-Z)
Stars
This repository summarizes recent advances in the VLA + RL paradigm and provides a taxonomic classification of relevant works.
Pytorch PI-zero and PI-zero-fast. Adapted from LeRobot
Find the Root Cause in Your Code's Trace
Kimi K2 is the large language model series developed by Moonshot AI team
An AI Powered README and Interactive Wiki Generator for Any Projects. AI驱动的README及交互式Wiki生成工具,面向中文的开源DeepWiki。
A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language Models
An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models
verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"
VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo
ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning
PLM: Efficient Peripheral Language Models Hardware-Co-Designed for Ubiquitous Computing
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
verl: Volcano Engine Reinforcement Learning for LLMs
SGLang is a fast serving framework for large language models and vision language models.
A collection of the books and papers on data science and machine learning.
Enabling Mixed Opponent Strategy Script and Self-play on SMAC
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)
Python library to decode StarCraft II replay protocols
toncula / DI-star
Forked from opendilab/DI-starOpenDILab Decision AI in StarCraftII
This project contains various scripts that can assist in the process of preparing datasets.
GPT4 powered AI coach to help you play on SC2 ladder
Clustering for SCII build orders in patch >=5.0.3 based on the spawningtool and sc2reader replay parsers.