Skip to content
View yogesh1q2w's full-sized avatar

Block or report yogesh1q2w

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 3 Updated Nov 16, 2025
Python 320 66 Updated Dec 19, 2024

Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms

Python 305 63 Updated Apr 29, 2023
Python 8 Updated Nov 14, 2025
Python 6 Updated Nov 11, 2025
Python 11 Updated Nov 4, 2025

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python 12,175 2,004 Updated Nov 14, 2025

Simple and concise implementation of DQN on toy environments.

Python 10 Updated Nov 14, 2025

Diagrams for visualizing neural network architecture

999 514 Updated Apr 12, 2025

🌸Eau De Q-Network [RLC 25] is a pruning algorithm specifically designed for RL which discovers the final sparsity level of the networks🌸

Python 6 Updated Feb 27, 2025

✨iterated Q-Network [TMLR 25] learns several Bellman iterations in parallel instead of learning them sequentially✨

Python 14 Updated Mar 29, 2025

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 153,132 31,252 Updated Nov 28, 2025

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

Jupyter Notebook 10,820 1,391 Updated Nov 4, 2024

⚡ Flashbax: Accelerated Replay Buffers in JAX

Python 263 21 Updated Sep 22, 2025

Imitation learning benchmark focusing on complex locomotion tasks using MuJoCo.

Python 1,272 134 Updated May 30, 2025