Skip to content
View cartgr's full-sized avatar

Block or report cartgr

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Enabling constructive dialogue.

Dart 44 8 Updated Nov 14, 2025

utilities for decoding deep representations (like sentence embeddings) back to text

Python 987 110 Updated Aug 5, 2025

Extract full next-token probabilities via language model APIs

Python 247 13 Updated Feb 23, 2024

The Nomyx game

Dockerfile 84 9 Updated Oct 22, 2023

Governance of the Commons Simulation (GovSim)

Python 1 1 Updated Nov 21, 2024
Python 2 Updated Jul 17, 2024

Generative Agents: Interactive Simulacra of Human Behavior

Python 78 29 Updated Jul 10, 2025

course homepage for Introduction to Machine Learning

Jupyter Notebook 34 7 Updated Dec 14, 2024

Dataset to propose for TidyTuesday

R 3 1 Updated Oct 26, 2021

A lightweight research framework

Python 28 4 Updated Oct 14, 2025

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Python 3,091 282 Updated Jun 4, 2024

Code and data for our IROS paper: "Are Large Language Models Aligned with People's Social Intuitions for Human–Robot Interactions?"

Jupyter Notebook 4 3 Updated Mar 20, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,393 815 Updated Nov 9, 2025

An index of algorithms for reinforcement learning from human feedback (rlhf))

92 3 Updated Apr 17, 2024

Language model alignment-focused deep learning curriculum

1,492 119 Updated Aug 19, 2024

An API conversion tool for popular external reinforcement learning environments

Python 191 24 Updated Oct 28, 2025

creating agents with normative reasoning ability

Jupyter Notebook 2 Updated Jun 16, 2025

A simple altar game based on Phaser3

JavaScript 1 Updated Jan 15, 2024

Maximum diversity problem solver in Python using a genetic algorithm

Python 2 Updated Nov 28, 2022

This is the official implementation of Multi-Agent PPO (MAPPO).

Python 1,761 350 Updated Jul 18, 2024

A Python library for dynamic classifier and ensemble selection

Python 495 109 Updated Apr 15, 2024

A library for generative social simulation

Python 1,076 229 Updated Nov 7, 2025

Code for the paper "Neural MMO: A Massively Multiagent Game Environment for Training and Evaluating Intelligent Agents"

Python 1,646 268 Updated Jul 21, 2023

Official implementation of "Multi-Task Learning as a Bargaining Game" [ICML 2022]

Python 233 28 Updated Jun 25, 2025

Harvard Joint CS + Government Thesis Project 2018-2019: Escaping the State of Nature

Python 5 Updated Apr 29, 2019

hanabi_learning_environment is a research platform for Hanabi experiments.

Python 655 163 Updated Feb 14, 2023

This is a suite of reinforcement learning environments illustrating various safety properties of intelligent agents.

Python 617 127 Updated May 18, 2022
Next