Project RL Assignment

Group 15

Michal Butkiewicz, Sunny Soni, Andrzej Szczepura

Steps for setting up the environment :

For anaconda, use the command : conda env create -f environment.yml in the main directory.

For vanilla python virtual environment or installation, you can also use : pip install -r requirements.txt

File Descriptions :

Agent_Final.py : contains our self designed gym environment preprocess.py : contains code for preprocessing the xlsx files for our environment DDQN_Agent.py : contains the DDQN based RL agent ddqn_train.py : contains code for training and validating our DDQN agent on our custom environment tabular_qlearning.py : contains the Tabular Qlearning based RL agent as well as code for training and testing the models random_baseline.py : code for testing our agent with random actions on validation set

exploratory_data_analysis.ipynb : EDA python notebooks Plots.ipynb : Python notebooks with the code used for plotting graphs for our report ddqn_plot_v_values.py : code for plotting the v-value plots

Experiments :

In all the experiment files, you can change the mode variable to 'train' to train your own models using the custom agent.

For DDQN :

Use ddqn_train.py for training/validation.
Look for parameters to configure using the class definition in DDQN_Agent.py.

For Tabular :

Use tabular_qlearning.py for training and validation.

Random baseline :

Use random_baseline.py for console output.

Validation on standard environment :

main.py makes use of our pretrained ddqn agent using the basic set to work on the validation set provided in validate.xlsx. The model used for this is provided in the same directory and should work out of the box.

Model and Data directory

If model and data directory do not exist, the validation will fail to work. Train your models first in these cases before trying to validate the custom environment.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Project RL Assignment

Group 15

Steps for setting up the environment :

File Descriptions :

Experiments :

For DDQN :

For Tabular :

Random baseline :

Validation on standard environment :

Model and Data directory

About

Uh oh!

Uh oh!

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 84 Commits
archived_results		archived_results
data		data
model		model
plots		plots
rough_notebooks		rough_notebooks
.gitignore		.gitignore
Agent_Final.py		Agent_Final.py
DDQN_Agent.py		DDQN_Agent.py
Plots.ipynb		Plots.ipynb
README.md		README.md
TestEnv.py		TestEnv.py
best_online_net.bin		best_online_net.bin
ddqn_plot_v_values.py		ddqn_plot_v_values.py
ddqn_train.py		ddqn_train.py
environment.yml		environment.yml
exploratory_data_analysis.ipynb		exploratory_data_analysis.ipynb
main.py		main.py
preprocess.py		preprocess.py
random_baseline.py		random_baseline.py
requirements.txt		requirements.txt
tabular_qlearning.py		tabular_qlearning.py
train.xlsx		train.xlsx
train_mean_std.bin		train_mean_std.bin
validate.xlsx		validate.xlsx

Kapitan11/project_rl_asgn

Folders and files

Latest commit

History

Repository files navigation

Project RL Assignment

Group 15

Steps for setting up the environment :

File Descriptions :

Experiments :

For DDQN :

For Tabular :

Random baseline :

Validation on standard environment :

Model and Data directory

About

Resources

Uh oh!

Stars

Watchers

Forks

Uh oh!

Contributors 2

Uh oh!

Languages