dementrock

Rocky Duan dementrock

308 followers · 56 following

http://www.rockyduan.com

Achievements

Organizations

Stars

skypilot-org / skypilot

Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 17+ clouds, or on-prem).

Python 8,933 838 Updated Nov 11, 2025

eg4000 / SKU110K_CVPR19

Python 822 185 Updated Mar 24, 2023

justinjfu / doodad

A job launching library for docker, EC2, GCP, etc.

Python 57 39 Updated Aug 10, 2021

tensorpack / tensorpack

A Neural Net Training Interface on TensorFlow, with focus on speed + flexibility

Python 6,303 1,795 Updated Aug 6, 2023

anishathalye / obfuscated-gradients

Obfuscated Gradients Give a False Sense of Security: Circumventing Defenses to Adversarial Examples

Jupyter Notebook 904 172 Updated Jun 10, 2023

google-deepmind / pysc2

StarCraft II Learning Environment

Python 8,208 1,172 Updated Jul 23, 2024

unixpickle / muniverse

µniverse: RL environments for HTML5 games

JavaScript 366 22 Updated Jan 3, 2019

ShibiHe / Q-Optimality-Tightening

This is my implementation of the Optimality Tightening

Python 37 8 Updated Apr 26, 2017

seba-1511 / drl.pth

Implementation of Policy Gradient algorithms in PyTorch. (Sequential, Distributed sync + async)

Python 9 2 Updated Nov 7, 2017

google-deepmind / lab

A customisable 3D platform for agent-based AI research

C 7,277 1,391 Updated Jan 4, 2023

openai / universe-starter-agent

A starter agent that can solve a number of universe environments.

Python 1,102 315 Updated Apr 7, 2018

openai / universe

Universe: a software platform for measuring and training an AI's general intelligence across the world's supply of games, websites and other applications.

Python 7,514 958 Updated Apr 5, 2018

jiamings / fast-weights

Implementation of the paper [Using Fast Weights to Attend to the Recent Past](https://arxiv.org/abs/1610.06258)

Python 172 23 Updated Nov 3, 2016

openai / vime

Code for the paper "Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks"

Python 350 91 Updated Nov 22, 2018

openai / InfoGAN

Code for reproducing key results in the paper "InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets"

Python 1,072 304 Updated Mar 25, 2021

baidu-research / persistent-rnn

Fast Recurrent Networks Library

C++ 577 88 Updated Sep 20, 2016

jych / nips2015_vrnn

Python 291 99 Updated Mar 13, 2018

michaelhush / M-LOOP

M-LOOP: Machine-learning online optimization package

Python 169 58 Updated Sep 16, 2025

openai / gym

A toolkit for developing and comparing reinforcement learning algorithms.

Python 36,759 8,710 Updated Oct 11, 2024

rll / rllab

rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.

Python 3,019 802 Updated Jun 10, 2023

ntasfi / PyGame-Learning-Environment

PyGame Learning Environment (PLE) -- Reinforcement Learning Environment in Python.

Python 1,054 232 Updated Jan 19, 2022

casperkaae / parmesan

Variational and semi-supervised neural network toppings for Lasagne

Python 210 31 Updated Aug 25, 2016

cbfinn / gps

Guided Policy Search

Python 602 245 Updated Feb 9, 2021

snipsco / ntm-lasagne

Neural Turing Machines library in Theano with Lasagne

Python 301 51 Updated Jul 31, 2018

lukaszkaiser / NeuralGPU

Code for the Neural GPU

49 1 Updated Mar 15, 2016

dementrock / tensorfuse

Common interface for Theano, CGT, and TensorFlow

Python 238 18 Updated Apr 23, 2016

kjw0612 / awesome-deep-vision

A curated list of deep learning resources for computer vision

11,081 2,784 Updated Aug 15, 2023

janismac / ControlChallenges

JavaScript 136 22 Updated Jun 2, 2024

michalkoziarski / Deep-RL-Zoo

Model Zoo for Deep Reinforcement Learning

14 2 Updated Dec 19, 2015

pybox2d / pybox2d

2D Game Physics for Python

Python 506 91 Updated Nov 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rocky Duan dementrock

Achievements

Achievements

Organizations

Block or report dementrock

Stars

skypilot-org / skypilot

eg4000 / SKU110K_CVPR19

justinjfu / doodad

tensorpack / tensorpack

anishathalye / obfuscated-gradients

google-deepmind / pysc2

unixpickle / muniverse

ShibiHe / Q-Optimality-Tightening

seba-1511 / drl.pth

google-deepmind / lab

openai / universe-starter-agent

openai / universe

jiamings / fast-weights

openai / vime

openai / InfoGAN

baidu-research / persistent-rnn

jych / nips2015_vrnn

michaelhush / M-LOOP

openai / gym

rll / rllab

ntasfi / PyGame-Learning-Environment

casperkaae / parmesan

cbfinn / gps

snipsco / ntm-lasagne

lukaszkaiser / NeuralGPU

dementrock / tensorfuse

kjw0612 / awesome-deep-vision

janismac / ControlChallenges

michalkoziarski / Deep-RL-Zoo

pybox2d / pybox2d