-
Jax implementation of HL-Gauss loss (from the paper "Stop Regressing: Training Value Functions via Classification for Scalable Deep RL") on top of DRL algorithms.
-
multi-type-feedback Public
Forked from ymetz/multi-type-feedbackJupyter Notebook MIT License UpdatedJul 3, 2025 -
jaxpruner Public
Forked from google-research/jaxprunerPython Apache License 2.0 UpdatedJan 26, 2025 -
dopamine Public
Forked from google/dopamineDopamine is a research framework for fast prototyping of reinforcement learning algorithms.
Jupyter Notebook Apache License 2.0 UpdatedJan 24, 2025 -
drqv2_pytorch Public
Forked from facebookresearch/drqv2DrQ-v2: Improved Data-Augmented Reinforcement Learning
Python MIT License UpdatedJan 11, 2025 -
-
SafeOffPolicy Public
Forked from PKU-Alignment/omnisafeJMLR: OmniSafe is an infrastructural framework for accelerating SafeRL research.
Python Apache License 2.0 UpdatedOct 15, 2024 -
CAL Public
Code accompanying the paper "Off-Policy Primal-Dual Safe Reinforcement Learning"
-
Plan-to-Predict Public
Code accompanying paper "Plan To Predict: Learning an Uncertainty-Foreseeing Model for Model-Based Reinforcement Learning".
-
MAG Public
Code accompanying paper "Models as Agents: Optimizing Multi-Step Predictions of Interactive Local Models in Model-Based Multi-Agent Reinforcement Learning".
-
mbrl-lib Public
Forked from facebookresearch/mbrl-libLibrary for Model Based RL
Python MIT License UpdatedOct 6, 2022 -
-
SAC-Lagrangian Public
Forked from ammarhydr/SAC-LagrangianPyTorch implementation of Constrained Reinforcement Learning for Soft Actor Critic Algorithm
Python UpdatedJul 11, 2022 -
mbpo_pytorch Public
Forked from Xingyu-Lin/mbpo_pytorchA pytorch reprelication of the model-based reinforcement learning algorithm MBPO
Python UpdatedApr 12, 2022 -
Coordinated-PPO Public
Code accompanying paper "Coordinated Proximal Policy Optimization"
-
pytorch-mopo Public
Forked from yihaosun1124/pytorch-mopore-implementation of the offline model-based RL algorithm MOPO in pytorch
Python MIT License UpdatedFeb 28, 2022 -
fitting-random-labels Public
Forked from pluskid/fitting-random-labelsExample code for the paper "Understanding deep learning requires rethinking generalization"
Python MIT License UpdatedJun 2, 2020