Skip to content
View icwhite's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report icwhite

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Guide for fine-tuning Llama/Mistral/CodeLlama models and more

Python 644 103 Updated Oct 15, 2025

Open Source Model Context Protocol server for PowerPoint automation on Windows via pywin32

Python 21 5 Updated Dec 29, 2025

Building modular LMs with parameter-efficient fine-tuning.

Python 114 22 Updated Oct 19, 2025

A scalable asynchronous reinforcement learning implementation with in-flight weight updates.

Python 344 33 Updated Dec 23, 2025

[AAAI 2026] Open-Source LLM-Based Data Analysis Agents

Python 60 4 Updated Nov 10, 2025

Post-training with Tinker

Python 2,694 288 Updated Jan 7, 2026

Democratizing Reinforcement Learning for LLMs

Python 4,955 476 Updated Jan 6, 2026
Python 53 6 Updated Oct 10, 2024

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)

Python 8,741 845 Updated Jan 6, 2026

Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks

Python 254 11 Updated May 5, 2025

Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)

Python 159 15 Updated Oct 30, 2024

Project Malmo is a platform for Artificial Intelligence experimentation and research built on top of Minecraft. We aim to inspire a new generation of research into challenging new problems presente…

Java 4,239 610 Updated Sep 3, 2025

Minecraft AI with LLMs+Mineflayer

JavaScript 4,603 634 Updated Dec 29, 2025

Llama factory adaptation for llm minecraft agents

Python 1 Updated Jun 1, 2025

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,426 212 Updated Jan 7, 2026
JavaScript 2 1 Updated Nov 3, 2025

A Text-Based Environment for Interactive Debugging

Python 287 37 Updated Jan 7, 2026

Text Adventure Learning Environment Suite - Benchmark to evaluate language models on interactive text environments.

Jupyter Notebook 23 6 Updated Oct 27, 2025
Python 58 17 Updated Sep 18, 2025

Anonymous Github is a proxy server to support anonymous browsing of Github repositories for open-science code and data.

TypeScript 1,942 76 Updated Jan 1, 2026

Scripts for serving vllm on multi node

Python 1 Updated Feb 24, 2025

DialOp: Decision-oriented dialogue environments for collaborative language agents

Python 111 8 Updated Nov 15, 2024

Framework and toolkits for building and evaluating collaborative agents that can work together with humans.

Python 117 17 Updated Dec 4, 2025
Python 116 6 Updated Jan 21, 2025
HTML 1 Updated Dec 22, 2025

🐟 A simple theme for Jekyll. Live at https://eliottvincent.github.io/bay/

HTML 179 407 Updated Sep 17, 2025

An Open-Ended Agentic Simulator

Python 58 7 Updated Aug 11, 2024

Simulating Large-Scale Multi-Agent Interactions with Limited Multimodal Senses and Physical Needs

Python 103 24 Updated Sep 30, 2025

Code for RSA+C3 paper

Python 4 Updated Aug 12, 2024
Next