Fascinated with many things, such as:
Artificial Intelligence, Physics, Machine Learning and Reinforcement Learning
Highlights
- Pro
Pinned Loading
-
distillation-after-training
distillation-after-training PublicOfficial PyTorch implementation of "How Ensembles of Distilled Policies Improve Generalisation in Reinforcement Learning" (NeurIPS 2025)
Python
-
-
lever-game
lever-game PublicImplements the Lever Coordination Game and shows that the other-play learning algorithm outperforms basic self-play and league-play agents in the zero-shot coordination scenario.
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.