GitHub - XinJingHao/DRL-Pytorch: Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)

Clean, Robust, and Unified PyTorch implementation of popular DRL Algorithms

0.Star History

1.Dependencies

This repository uses the following python dependencies unless explicitly stated:

gymnasium==0.29.1
numpy==1.26.1
pytorch==2.1.0

python==3.11.5

2.How to use my code

Enter the folder of the algorithm that you want to use, and run the main.py to train from scratch:

python main.py

For more details, please check the README.md file in the corresponding algorithm folder.

3. Separate links of the code

4. Recommended Resources for DRL

4.1 Simulation Environments:

gym and gymnasium (Lightweight & Standard Env for DRL; Easy to start; Slow):

Isaac Sim (NVIDIA’s physics simulation environment; GPU accelerated; Superfast):

Sparrow (Light Weight Simulator for Mobile Robot; DRL friendly):

ROS (Popular & Comprehensive physical simulator for robots; Heavy and Slow):

Webots (Popular physical simulator for robots; Faster than ROS; Less realistic):

Envpool (Fast Vectorized Env)
Other Popular Envs

4.2 Books：

《Reinforcement learning: An introduction》--Richard S. Sutton
《深度学习入门：基于Python的理论与实现》--斋藤康毅

4.3 Online Courses:

RL Courses(bilibili)--李宏毅(Hongyi Li)
RL Courses(Youtube)--李宏毅(Hongyi Li)
UCL Course on RL--David Silver
动手强化学习--上海交通大学
DRL Courses--Shusen Wang

4.4 Blogs:

5. Important Papers

DQN: Mnih V, Kavukcuoglu K, Silver D, et al. Human-level control through deep reinforcement learning[J]. nature, 2015, 518(7540): 529-533.

Double DQN: Van Hasselt H, Guez A, Silver D. Deep reinforcement learning with double q-learning[C]//Proceedings of the AAAI conference on artificial intelligence. 2016, 30(1).

Duel DQN: Wang, Ziyu, et al. "Dueling network architectures for deep reinforcement learning." International conference on machine learning. PMLR, 2016.

PER: Schaul T, Quan J, Antonoglou I, et al. Prioritized experience replay[J]. arXiv preprint arXiv:1511.05952, 2015.

C51: Bellemare M G, Dabney W, Munos R. A distributional perspective on reinforcement learning[C]//International conference on machine learning. PMLR, 2017: 449-458.

NoisyNet DQN: Fortunato M, Azar M G, Piot B, et al. Noisy networks for exploration[J]. arXiv preprint arXiv:1706.10295, 2017.

PPO: Schulman J, Wolski F, Dhariwal P, et al. Proximal policy optimization algorithms[J]. arXiv preprint arXiv:1707.06347, 2017.

DDPG: Lillicrap T P, Hunt J J, Pritzel A, et al. Continuous control with deep reinforcement learning[J]. arXiv preprint arXiv:1509.02971, 2015.

TD3: Fujimoto S, Hoof H, Meger D. Addressing function approximation error in actor-critic methods[C]//International conference on machine learning. PMLR, 2018: 1587-1596.

SAC: Haarnoja T, Zhou A, Abbeel P, et al. Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor[C]//International conference on machine learning. PMLR, 2018: 1861-1870.

ASL: Train a Real-world Local Path Planner in One Hour via Partially Decoupled Reinforcement Learning and Vectorized Diversity

ColorDynamic: Generalizable, Scalable, Real-time, End-to-end Local Planner for Unstructured and Dynamic Environments

6. Citation

@misc{DRL-Pytorch,
  author = {Jinghao Xin},
  title = {DRL-Pytorch},
  year = {2022},
  publisher = {GitHub},
  journal = {GitHub Repository},
  howpublished = {\url{https://github.com/XinJingHao/DRL-Pytorch}},
}

7. Training Curves of my Code:

Q-learning:

Duel Double DQN:

CartPole	LunarLander

Noisy Duel DDQN on Atari Game:

Pong	Enduro

Prioritized DQN/DDQN:

CartPole	LunarLander

Categorical DQN:

CartPole	LunarLander

NoisyNet DQN:

CartPole	LunarLander

PPO Discrete:

PPO Continuous:

DDPG:

Pendulum	LunarLanderContinuous

Name		Name	Last commit message	Last commit date
Latest commit History 115 Commits
1.Q-learning		1.Q-learning
2.1_Duel-Double-DQN		2.1_Duel-Double-DQN
2.2_Noisy-Duel-DDQN-Atari		2.2_Noisy-Duel-DDQN-Atari
2.3 Prioritized-Experience-Replay-DDQN-DQN		2.3 Prioritized-Experience-Replay-DDQN-DQN
2.4_Categorical-DQN_C51		2.4_Categorical-DQN_C51
2.5_NoisyNet-DQN		2.5_NoisyNet-DQN
3.1 PPO-Discrete		3.1 PPO-Discrete
3.2 PPO-Continuous		3.2 PPO-Continuous
4.1 DDPG		4.1 DDPG
4.2 TD3		4.2 TD3
5.1 SAC-Discrete		5.1 SAC-Discrete
5.2 SAC-Continuous		5.2 SAC-Continuous
6. Actor-Sharer-Learner		6. Actor-Sharer-Learner
README.md		README.md
RL_PYTORCH.png		RL_PYTORCH.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

0.Star History

1.Dependencies

2.How to use my code

3. Separate links of the code

4. Recommended Resources for DRL

4.1 Simulation Environments:

4.2 Books：

4.3 Online Courses:

4.4 Blogs:

5. Important Papers

6. Citation

7. Training Curves of my Code:

Q-learning:

Duel Double DQN:

Noisy Duel DDQN on Atari Game:

Prioritized DQN/DDQN:

Categorical DQN:

NoisyNet DQN:

PPO Discrete:

PPO Continuous:

DDPG:

TD3:

SAC Continuous:

SAC Discrete:

Actor-Sharer-Learner (ASL):

About

Uh oh!

Releases

Packages

Languages

XinJingHao/DRL-Pytorch

Folders and files

Latest commit

History

Repository files navigation

0.Star History

1.Dependencies

2.How to use my code

3. Separate links of the code

4. Recommended Resources for DRL

4.1 Simulation Environments:

4.2 Books：

4.3 Online Courses:

4.4 Blogs:

5. Important Papers

6. Citation

7. Training Curves of my Code:

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Languages