Skip to content
View dementrock's full-sized avatar

Organizations

@rll @rllab

Block or report dementrock

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 17+ clouds, or on-prem).

Python 8,933 838 Updated Nov 11, 2025
Python 822 185 Updated Mar 24, 2023

A job launching library for docker, EC2, GCP, etc.

Python 57 39 Updated Aug 10, 2021

A Neural Net Training Interface on TensorFlow, with focus on speed + flexibility

Python 6,303 1,795 Updated Aug 6, 2023

Obfuscated Gradients Give a False Sense of Security: Circumventing Defenses to Adversarial Examples

Jupyter Notebook 904 172 Updated Jun 10, 2023

StarCraft II Learning Environment

Python 8,208 1,172 Updated Jul 23, 2024

µniverse: RL environments for HTML5 games

JavaScript 366 22 Updated Jan 3, 2019

This is my implementation of the Optimality Tightening

Python 37 8 Updated Apr 26, 2017

Implementation of Policy Gradient algorithms in PyTorch. (Sequential, Distributed sync + async)

Python 9 2 Updated Nov 7, 2017

A customisable 3D platform for agent-based AI research

C 7,277 1,391 Updated Jan 4, 2023

A starter agent that can solve a number of universe environments.

Python 1,102 315 Updated Apr 7, 2018

Universe: a software platform for measuring and training an AI's general intelligence across the world's supply of games, websites and other applications.

Python 7,514 958 Updated Apr 5, 2018

Implementation of the paper [Using Fast Weights to Attend to the Recent Past](https://arxiv.org/abs/1610.06258)

Python 172 23 Updated Nov 3, 2016

Code for the paper "Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks"

Python 350 91 Updated Nov 22, 2018

Code for reproducing key results in the paper "InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets"

Python 1,072 304 Updated Mar 25, 2021

Fast Recurrent Networks Library

C++ 577 88 Updated Sep 20, 2016
Python 291 99 Updated Mar 13, 2018

M-LOOP: Machine-learning online optimization package

Python 169 58 Updated Sep 16, 2025

A toolkit for developing and comparing reinforcement learning algorithms.

Python 36,759 8,710 Updated Oct 11, 2024

rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.

Python 3,019 802 Updated Jun 10, 2023

PyGame Learning Environment (PLE) -- Reinforcement Learning Environment in Python.

Python 1,054 232 Updated Jan 19, 2022

Variational and semi-supervised neural network toppings for Lasagne

Python 210 31 Updated Aug 25, 2016

Guided Policy Search

Python 602 245 Updated Feb 9, 2021

Neural Turing Machines library in Theano with Lasagne

Python 301 51 Updated Jul 31, 2018

Code for the Neural GPU

49 1 Updated Mar 15, 2016

Common interface for Theano, CGT, and TensorFlow

Python 238 18 Updated Apr 23, 2016

A curated list of deep learning resources for computer vision

11,081 2,784 Updated Aug 15, 2023
JavaScript 136 22 Updated Jun 2, 2024

Model Zoo for Deep Reinforcement Learning

14 2 Updated Dec 19, 2015

2D Game Physics for Python

Python 506 91 Updated Nov 29, 2024
Next