Stars
Guide for fine-tuning Llama/Mistral/CodeLlama models and more
Open Source Model Context Protocol server for PowerPoint automation on Windows via pywin32
Building modular LMs with parameter-efficient fine-tuning.
A scalable asynchronous reinforcement learning implementation with in-flight weight updates.
[AAAI 2026] Open-Source LLM-Based Data Analysis Agents
Post-training with Tinker
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)
Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks
Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)
Project Malmo is a platform for Artificial Intelligence experimentation and research built on top of Minecraft. We aim to inspire a new generation of research into challenging new problems presente…
Minecraft AI with LLMs+Mineflayer
hlillemark / LLaMA-Factory-mc
Forked from hiyouga/LlamaFactoryLlama factory adaptation for llm minecraft agents
SkyRL: A Modular Full-stack RL Library for LLMs
A Text-Based Environment for Interactive Debugging
Text Adventure Learning Environment Suite - Benchmark to evaluate language models on interactive text environments.
Anonymous Github is a proxy server to support anonymous browsing of Github repositories for open-science code and data.
DialOp: Decision-oriented dialogue environments for collaborative language agents
Framework and toolkits for building and evaluating collaborative agents that can work together with humans.
🐟 A simple theme for Jekyll. Live at https://eliottvincent.github.io/bay/
Simulating Large-Scale Multi-Agent Interactions with Limited Multimodal Senses and Physical Needs