Skip to content
View marisgg's full-sized avatar

Block or report marisgg

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
C++ 3 Updated Nov 18, 2025

CC-POMCP, "Monte-Carlo Tree Search for Constrained POMDPs (NIPS 2018)"

C++ 27 2 Updated Sep 30, 2018

Efficiently computes derivatives of NumPy code.

Python 7,398 929 Updated Nov 18, 2025

A JAX based compiler to make fast vectorized environment out of Storm models.

Python 2 1 Updated Nov 5, 2024

Storm for almost everyone

JavaScript 12 4 Updated Nov 19, 2025

Create and revise bibtex entries from DBLP

Python 23 8 Updated Sep 8, 2025

bash scripts for the ELO of RU

Shell 1 Updated Oct 7, 2020

Code of the Paper "Time-Efficient Reinforcement Learning with Stochastic Stateful Policies"

Python 25 2 Updated May 5, 2024

PyBullet CartPole and Quadrotor environments—with CasADi symbolic a priori dynamics—for learning-based control and RL

Python 794 153 Updated Nov 6, 2025

Sparse n-dimensional arrays in Python

12 6 Updated Feb 10, 2010

Python and Julia code for interfacing with X-Plane through UDP; similarly to XPlaneConnect, but also works for X-Plane 12.

Julia 20 Updated Feb 28, 2025
Java 2 Updated Jan 10, 2025

Models of Sequential Decision-Making

Python 52 6 Updated Jan 10, 2025

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Python 6,576 377 Updated Jun 2, 2025

RPM sources for the DisplayLink USB display adapters

Makefile 706 91 Updated Nov 17, 2025

fast + parallel AlphaZero in JAX

Python 106 11 Updated Dec 22, 2024

Logically-Constrained Reinforcement Learning

Python 54 14 Updated Jul 5, 2024

The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement …

C# 18,867 4,392 Updated Nov 14, 2025

A clean and robust Pytorch implementation of PPO on Discrete action space

Python 71 10 Updated Jun 8, 2024
C++ 23 23 Updated Nov 19, 2025

Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)

Python 90 11 Updated Nov 21, 2023

Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022

Python 338 47 Updated Aug 22, 2024

Implementation of the Dec-MCTS algorithm for multi-robot planning. Project for L32 Advanced Robotics course

Python 5 Updated Apr 28, 2022

Moore Machine Networks (MMN): Learning Finite-State Representations of Recurrent Policy Networks

Python 51 13 Updated Dec 8, 2022
Python 318 66 Updated Dec 19, 2024

alpha-beta-CROWN: An Efficient, Scalable and GPU Accelerated Neural Network Verifier (winner of VNN-COMP 2021, 2022, 2023, 2024, 2025)

Python 326 84 Updated Jan 31, 2025

Library for Model Based RL

Python 1,031 170 Updated Jul 12, 2024

Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms

Python 166 22 Updated May 9, 2023

Rendered math (MathJax) with Slack's desktop client

Python 315 29 Updated Feb 9, 2023
Python 23 Updated Apr 16, 2024
Next