Lists (1)
Sort Name ascending (A-Z)
Stars
CC-POMCP, "Monte-Carlo Tree Search for Constrained POMDPs (NIPS 2018)"
Efficiently computes derivatives of NumPy code.
A JAX based compiler to make fast vectorized environment out of Storm models.
mklinik / bb-scripts
Forked from squell/bb-scriptsbash scripts for the ELO of RU
Code of the Paper "Time-Efficient Reinforcement Learning with Stochastic Stateful Policies"
PyBullet CartPole and Quadrotor environments—with CasADi symbolic a priori dynamics—for learning-based control and RL
Python and Julia code for interfacing with X-Plane through UDP; similarly to XPlaneConnect, but also works for X-Plane 12.
arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv
RPM sources for the DisplayLink USB display adapters
The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement …
A clean and robust Pytorch implementation of PPO on Discrete action space
Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)
Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022
Implementation of the Dec-MCTS algorithm for multi-robot planning. Project for L32 Advanced Robotics course
Moore Machine Networks (MMN): Learning Finite-State Representations of Recurrent Policy Networks
alpha-beta-CROWN: An Efficient, Scalable and GPU Accelerated Neural Network Verifier (winner of VNN-COMP 2021, 2022, 2023, 2024, 2025)
Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms
thisiscam / math-with-slack
Forked from fsavje/math-with-slackRendered math (MathJax) with Slack's desktop client