-
Adaptation-with-Noisy-OracLE Public
PyTorch implementation for our paper "Efficient Meta Reinforcement Learning for Preference-based Fast Adaptation"
-
Doubly-Bounded-Q-Learning Public
TensorFlow implementation for our paper "On the Estimation Bias in Double Q-Learning"
-
TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"
-
Hindsight-Goal-Generation Public
TensorFlow implementation for our paper "Exploration via Hindsight Goal Generation"