Stars
Implementation for the paper: Reward Learning from Multiple Feedback Types (ICLR2025)
Simple single-file baselines for Q-Learning in pure-GPU setting
Skeleton for scalable and flexible Jax RL implementations
Long-Term Evolution Project of Reinforcement Learning
Procgen Benchmark: Procedurally-Generated Game-Like Gym-Environments