-
-
-
-
-
-
-
-
Minimax-Value-Interval Public
Code for paper "Minimax Value Interval for Off-Policy Evaluation and Policy Optimization".
Python UpdatedOct 7, 2020 -
DR-PG Public
Forked from gtrll/rlfamily_cvCode for the paper "From Importance Sampling to Doubly Robust Policy Gradient"
-
lihang-code Public
Forked from fengdu78/lihang-code《统计学习方法》的代码实现
Jupyter Notebook UpdatedDec 17, 2018