Skip to content
View colin-fox's full-sized avatar
  • Ncepuer
  • Beijing, China

Block or report colin-fox

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Single file implementations of Deep Multi-agent Reinforcement Learning

Python 35 7 Updated Oct 13, 2025

LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.

Python 638 27 Updated Aug 22, 2025

🚀 A fast safe reinforcement learning library in PyTorch

Python 217 32 Updated Sep 30, 2024

Source code for the papers: RL for Mitigating Cascading Failures: Targeted Exploration via Sensitivity Factors (NeurIPS) / Blackout Mitigation via Physics-guided RL (IEEE TPS). Built with TensorFlow.

Python 2 Updated Jul 4, 2025

Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)

Python 3,000 371 Updated Jun 11, 2025

RL2Grid is a standardized benchmark for reinforcement learning (RL) agents in realistic power grid environments. Built on top of Grid2Op, it models real-time operations such as topology optimizatio…

Python 33 6 Updated Sep 4, 2025

A framework to design Reinforcement Learning environments that model Active Network Management (ANM) tasks in electricity distribution networks.

Python 164 36 Updated Nov 14, 2024

This repository contains an implementation of a UAV-based task offloading model using the Multi-Agent Deep Deterministic Policy Gradient (MADDPG) reinforcement learning algorithm.

Python 129 11 Updated Jun 15, 2025

The Flatland Framework is a multi-purpose environment to tackle problems around resilient resource allocation under uncertainty. It is designed to be a flexible and method agnostic to solve a wide …

Jupyter Notebook 47 14 Updated Oct 16, 2025

[IEEE TAI] Safe Multi-Agent Reinforcement Learning to Make decisions in Autonomous Driving.

Jupyter Notebook 77 11 Updated Apr 27, 2025

A high-capacity ride-sharing simulator calibrated by real request datasets and road netwoeks

Python 17 6 Updated May 2, 2023

The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization

Python 853 115 Updated Mar 23, 2024

A Fair and Scalable Time Series Forecasting Benchmark and Toolkit.

Python 1,451 181 Updated Oct 17, 2025

PyTorch implementation of Constrained Reinforcement Learning for Soft Actor Critic Algorithm

Python 54 7 Updated Jul 11, 2022

A clean and robust Pytorch implementation of SAC on discrete action space

Python 41 7 Updated Oct 23, 2024

Official code repo for the MARL book (www.marl-book.com)

Python 555 89 Updated Mar 30, 2025

Meta-RL Model-Based Algorithm

Python 40 3 Updated Apr 30, 2025

🙃 A delightful community-driven (with 2,400+ contributors) framework for managing your zsh configuration. Includes 300+ optional plugins (rails, git, macOS, hub, docker, homebrew, node, php, python…

Shell 182,105 26,255 Updated Oct 15, 2025

Explanation to key concepts in ML

8,105 660 Updated Jun 30, 2025

[TNNLS-2024, arXiv-2023.2.10] Official repository of "A Survey on Causal Reinforcement Learning"

52 3 Updated Aug 7, 2025

Official implementation for the NeurIPS 2023 paper: "Reduced Policy Optimization for Continuous Control with Hard Constraints"

Python 42 5 Updated Apr 1, 2024

Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)

Jupyter Notebook 3,051 277 Updated May 3, 2024
Jupyter Notebook 1 Updated Oct 15, 2025

A collection of useful .gitignore templates

170,131 83,008 Updated Sep 10, 2025

Bonus materials, exercises, and example projects for our Python tutorials

HTML 5,010 5,326 Updated Oct 15, 2025

Deep Q-Network (DQN) and Fitted Q-Iteration (FQI) tutorial for RL Summer School 2023

Jupyter Notebook 83 12 Updated Oct 1, 2025

Efficient Adversarial Training without Attacking: Worst-Case-Aware Robust Reinforcement Learning

Python 26 1 Updated Sep 13, 2023

Code implementation for the NeurIPS 2022 paper "Policy Optimization with Advantage Regularization for Long-Term Fairness in Decision Systems".

Python 8 6 Updated Apr 16, 2023
Python 316 80 Updated Mar 8, 2023

Transform any arXiv papers into slides using LLMs

Python 35 4 Updated Sep 4, 2025
Next