icwhite

🎯

Focusing

Isadora White icwhite

🎯

Focusing

5 followers · 1 following

Achievements

Stars

modal-labs / llm-finetuning

Guide for fine-tuning Llama/Mistral/CodeLlama models and more

Python 644 103 Updated Oct 15, 2025

Ayushmaniar / powerpoint-mcp

Open Source Model Context Protocol server for PowerPoint automation on Windows via pywin32

Python 21 5 Updated Dec 29, 2025

microsoft / mttl

Building modular LMs with parameter-efficient fine-tuning.

Python 114 22 Updated Oct 19, 2025

ServiceNow / PipelineRL

A scalable asynchronous reinforcement learning implementation with in-flight weight updates.

Python 344 33 Updated Dec 23, 2025

zjunlp / DataMind

[AAAI 2026] Open-Source LLM-Based Data Analysis Agents

Python 60 4 Updated Nov 10, 2025

thinking-machines-lab / tinker-cookbook

Post-training with Tinker

Python 2,694 288 Updated Jan 7, 2026

rllm-org / rllm

Democratizing Reinforcement Learning for LLMs

Python 4,955 476 Updated Jan 6, 2026

swt-user / DMPO

Python 53 6 Updated Oct 10, 2024

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)

Python 8,741 845 Updated Jan 6, 2026

facebookresearch / sweet_rl

Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks

Python 254 11 Updated May 5, 2025

Yifan-Song793 / ETO

Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)

Python 159 15 Updated Oct 30, 2024

microsoft / malmo

Project Malmo is a platform for Artificial Intelligence experimentation and research built on top of Minecraft. We aim to inspire a new generation of research into challenging new problems presente…

Java 4,239 610 Updated Sep 3, 2025

mindcraft-bots / mindcraft

Minecraft AI with LLMs+Mineflayer

JavaScript 4,603 634 Updated Dec 29, 2025

hlillemark / LLaMA-Factory-mc

Forked from hiyouga/LlamaFactory

Llama factory adaptation for llm minecraft agents

Python 1 Updated Jun 1, 2025

NovaSky-AI / SkyRL

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,426 212 Updated Jan 7, 2026

icwhite / mindcraft

Forked from mindcraft-bots/mindcraft

JavaScript 2 1 Updated Nov 3, 2025

microsoft / debug-gym

A Text-Based Environment for Interactive Debugging

Python 287 37 Updated Jan 7, 2026

microsoft / tale-suite

Text Adventure Learning Environment Suite - Benchmark to evaluate language models on interactive text environments.

Jupyter Notebook 23 6 Updated Oct 27, 2025

databricks / compose-rl

Python 58 17 Updated Sep 18, 2025

tdurieux / anonymous_github

Anonymous Github is a proxy server to support anonymous browsing of Github repositories for open-science code and data.

TypeScript 1,942 76 Updated Jan 1, 2026

bosung / vllm-multi-node

Scripts for serving vllm on multi node

Python 1 Updated Feb 24, 2025

Ayushmaniar / mindcraft_multiagent_task_generation

JavaScript 1 Updated Mar 1, 2025

jlin816 / dialop

DialOp: Decision-oriented dialogue environments for collaborative language agents

Python 111 8 Updated Nov 15, 2024

SALT-NLP / collaborative-gym

Framework and toolkits for building and evaluating collaborative agents that can work together with humans.

Python 117 17 Updated Dec 4, 2025

jwhj / OREO

Python 116 6 Updated Jan 21, 2025

ucsd-nlp / ucsd-nlp.github.io

HTML 1 Updated Dec 22, 2025

eliottvincent / bay

🐟 A simple theme for Jekyll. Live at https://eliottvincent.github.io/bay/

HTML 179 407 Updated Sep 17, 2025

luchris429 / JaxLife

An Open-Ended Agentic Simulator

Python 58 7 Updated Aug 11, 2024

cocacola-lab / MineLand

Simulating Large-Scale Multi-Agent Interactions with Limited Multimodal Senses and Physical Needs

Python 103 24 Updated Sep 30, 2025

icwhite / codenames

Code for RSA+C3 paper

Python 4 Updated Aug 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Isadora White icwhite

Achievements

Achievements

Block or report icwhite

Stars

modal-labs / llm-finetuning

Ayushmaniar / powerpoint-mcp

microsoft / mttl

ServiceNow / PipelineRL

zjunlp / DataMind

thinking-machines-lab / tinker-cookbook

rllm-org / rllm

swt-user / DMPO

OpenRLHF / OpenRLHF

facebookresearch / sweet_rl

Yifan-Song793 / ETO

microsoft / malmo

mindcraft-bots / mindcraft

hlillemark / LLaMA-Factory-mc

NovaSky-AI / SkyRL

icwhite / mindcraft

microsoft / debug-gym

microsoft / tale-suite

databricks / compose-rl

tdurieux / anonymous_github

bosung / vllm-multi-node

Ayushmaniar / mindcraft_multiagent_task_generation

jlin816 / dialop

SALT-NLP / collaborative-gym

jwhj / OREO

ucsd-nlp / ucsd-nlp.github.io

eliottvincent / bay

luchris429 / JaxLife

cocacola-lab / MineLand

icwhite / codenames