🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 64,559 6,542 Updated Nov 11, 2025

DeepReinforcementLearning / DeepReinforcementLearningInAction

Code from the Deep Reinforcement Learning in Action book from Manning, Inc

Jupyter Notebook 820 345 Updated Apr 22, 2024

LantaoYu / MARL-Papers

Paper list of multi-agent reinforcement learning (MARL)

4,605 764 Updated Nov 19, 2025

hehonghui / awesome-english-ebooks

经济学人(含音频)、纽约客、卫报、连线、大西洋月刊等英语杂志免费下载,支持epub、mobi、pdf格式, 每周更新

CSS 26,842 2,147 Updated Nov 21, 2025

johnjim0816 / joyrl-offline

Python 66 27 Updated Dec 27, 2023

johnjim0816 / joyrl-book

Jupyter Notebook 17 4 Updated Dec 20, 2023

PKUFlyingPig / cs-self-learning

计算机自学指南

HTML 69,556 7,741 Updated Nov 14, 2025

boyu-ai / Hands-on-RL

https://hrl.boyuai.com/

Jupyter Notebook 4,229 769 Updated Nov 22, 2022

dennybritz / reinforcement-learning

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

Jupyter Notebook 21,737 6,169 Updated Jul 13, 2023

tinyzqh / light_mappo

Lightweight version of MAPPO to help you quickly migrate to your local environment.

Python 764 103 Updated Oct 23, 2025

alexanderbaumann99 / PPO-Algorithms

Experiments of the three PPO-Algorithms (PPO, clipped PPO, PPO with KL-penalty) proposed by John Schulman et al. on the 'Cartpole-v1' environment.

Python 13 1 Updated Nov 14, 2021

datawhalechina / rl-papers

rl-papers

49 9 Updated Mar 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cloris clorisqiu1

Achievements

Achievements

Block or report clorisqiu1

Lists (1)

✨ Inspiration

Stars

georgewanglz2019 / MARL4TA

tinyzqh / OptiVerse

nelvko / clash-for-linux-install

datawhalechina / happy-llm

codecaution / Awesome-Mixture-of-Experts-Papers

XueFuzhao / awesome-mixture-of-experts

lucidrains / vit-pytorch

datawhalechina / wow-agent

datawhalechina / wow-rag

rlcode / per

xuehy / pytorch-maddpg

PufferAI / PufferLib

JasonZhujp / RL-Crowdsourcing

SafeRL-Lab / LLM-RL-Robotics-Papers

datawhalechina / whale-coin

datawhalechina / wow-fullstack

Engineer1999 / Double-Deep-Q-Learning-for-Resource-Allocation

openai / maddpg

labmlai / annotated_deep_learning_paper_implementations