Skip to content
View clorisqiu1's full-sized avatar

Block or report clorisqiu1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Multi-agent reinforcement learning for traffic assignment

Python 2 1 Updated Nov 22, 2025

A collection of Gymnasium environments for engineering optimization problems, designed for reinforcement learning (RL) research.

DIGITAL Command Language 1 Updated Nov 13, 2025

😼 优雅地使用基于 clash/mihomo 的代理环境

Shell 6,281 804 Updated Nov 27, 2025

📚 从零开始的大语言模型原理与实践教程

Jupyter Notebook 21,944 1,960 Updated Nov 18, 2025

A curated reading list of research in Mixture-of-Experts(MoE).

651 44 Updated Oct 30, 2024

A collection of AWESOME things about mixture-of-experts

1,231 81 Updated Dec 8, 2024

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 24,540 3,447 Updated Nov 27, 2025

A simple and trans-platform agent framework and tutorial

Jupyter Notebook 194 41 Updated Nov 9, 2025

A simple and trans-platform rag framework and tutorial

Jupyter Notebook 225 25 Updated Sep 13, 2025

Prioritized Experience Replay (PER) implementation in PyTorch

Python 355 76 Updated Feb 3, 2020

A pytorch implementation of MADDPG (multi-agent deep deterministic policy gradient)

Python 679 127 Updated Jun 5, 2018

Simplifying reinforcement learning for complex game environments

C 4,318 323 Updated Nov 26, 2025
Jupyter Notebook 4 2 Updated Jul 12, 2023

Large Language Models and Robotics.

21 Updated Apr 27, 2024
Python 4 5 Updated Mar 13, 2025

wow-fullstack,令人惊叹的全栈开发教程

TypeScript 219 44 Updated Oct 9, 2025

Reproduce results of the research article "Deep Reinforcement Learning Based Resource Allocation for V2V Communications"

Python 254 57 Updated Mar 17, 2022

Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"

Python 1,906 520 Updated Apr 1, 2024

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 64,559 6,542 Updated Nov 11, 2025

Code from the Deep Reinforcement Learning in Action book from Manning, Inc

Jupyter Notebook 820 345 Updated Apr 22, 2024

Paper list of multi-agent reinforcement learning (MARL)

4,605 764 Updated Nov 19, 2025

经济学人(含音频)、纽约客、卫报、连线、大西洋月刊等英语杂志免费下载,支持epub、mobi、pdf格式, 每周更新

CSS 26,842 2,147 Updated Nov 21, 2025
Python 66 27 Updated Dec 27, 2023
Jupyter Notebook 17 4 Updated Dec 20, 2023

计算机自学指南

HTML 69,556 7,741 Updated Nov 14, 2025

https://hrl.boyuai.com/

Jupyter Notebook 4,229 769 Updated Nov 22, 2022

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

Jupyter Notebook 21,737 6,169 Updated Jul 13, 2023

Lightweight version of MAPPO to help you quickly migrate to your local environment.

Python 764 103 Updated Oct 23, 2025

Experiments of the three PPO-Algorithms (PPO, clipped PPO, PPO with KL-penalty) proposed by John Schulman et al. on the 'Cartpole-v1' environment.

Python 13 1 Updated Nov 14, 2021

rl-papers

49 9 Updated Mar 17, 2023
Next