Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

Jupyter Notebook 21,626 6,161 Updated Jul 13, 2023

apachecn / ailearning

AiLearning：数据分析+机器学习实战+线性代数+PyTorch+NLTK+TF2

Python 41,547 11,593 Updated Nov 12, 2024

chihming / competitive-recsys

A collection of resources for Recommender Systems (RecSys)

536 115 Updated Dec 13, 2021

mitmath / 1806

18.06 course at MIT

Jupyter Notebook 2,897 744 Updated May 12, 2025

bannedbook / fanqiang

翻墙-科学上网

Kotlin 40,656 7,393 Updated Oct 6, 2025

mengf1 / DHER

DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)

Python 66 7 Updated Nov 8, 2019

jiangyiqun233 / PRML_learning

learning fomula

Jupyter Notebook 293 60 Updated Jul 24, 2021

remoteintech / remote-jobs

A list of semi to fully remote-friendly companies (jobs) in tech.

JavaScript 39,281 3,848 Updated Oct 12, 2025

NeuroCSUT / DeepMind-Atari-Deep-Q-Learner-2Player

Forked from DorianKodelja/DeepMind-Atari-Deep-Q-Learner-2Player

Multiagent Cooperation and Competition with Deep Reinforcement Learning

Lua 123 35 Updated Nov 26, 2015

sisl / MADRL

Repo containing code for multi-agent deep reinforcement learning (MADRL).

Python 720 124 Updated Apr 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

yexm xuemei-ye

Achievements

Achievements

Block or report xuemei-ye

Stars

MatthewLQM / Paper-Of-Compute-Advertising

wnzhang / rtb-papers

QMMMS / QMMMS.github.io

wangshusen / DRL

openreasoner / openr

autogluon / autogluon

hydecorp / hydejack-starter-kit

boyu-ai / Hands-on-RL

Future-House / paper-qa

opendilab / awesome-decision-transformer

brianmaierjr / long-haul

cotes2020 / chirpy-starter

OuYaMing / Image-classification-and-target-detection-by-pytorch

lang-du / fruit_detection

MathFoundationRL / Book-Mathematical-Foundation-of-Reinforcement-Learning

HqWu-HITCS / Awesome-Chinese-LLM

blcuicall / taoli

wangshusen / RecommenderSystem

Kulbear / deep-learning-coursera

bumingbaipod / podcast

dennybritz / reinforcement-learning