yogesh1q2w

Yogesh Tripathi yogesh1q2w

Ongoing MSc in AI and ML at TU Darmstadt || Working on Deep RL at IAS lab (Dr. Jan Peters' group) || Previously B.Tech in CSE at IIT Madras

3 followers · 6 following

Darmstadt, Germany
https://github.com/yogesh1q2w
in/yogesh1q2w
https://scholar.google.com/citations?user=NRvRRmYAAAAJ&hl=en
@yogesh1q2w.bsky.social
@YogeshTrip7354

Achievements

Stars

slimRL / slimStreamQ

Python 3 Updated Nov 16, 2025

kenjyoung / MinAtar

Python 320 66 Updated Dec 19, 2024

Stable-Baselines-Team / stable-baselines

Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms

Python 305 63 Updated Apr 29, 2023

slimRL / slimCQL

Python 8 Updated Nov 14, 2025

slimRL / slimSAC

Python 6 Updated Nov 11, 2025

slimRL / slimBBF

Python 11 Updated Nov 4, 2025

DLR-RM / stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python 12,175 2,004 Updated Nov 14, 2025

slimRL / slimDQN

Simple and concise implementation of DQN on toy environments.

Python 10 Updated Nov 14, 2025

kennethleungty / Neural-Network-Architecture-Diagrams

Diagrams for visualizing neural network architecture

999 514 Updated Apr 12, 2025

theovincent / EauDeDQN

🌸Eau De Q-Network [RLC 25] is a pruning algorithm specifically designed for RL which discovers the final sparsity level of the networks🌸

Python 6 Updated Feb 27, 2025

theovincent / i-DQN

✨iterated Q-Network [TMLR 25] learns several Bellman iterations in parallel instead of learning them sequentially✨

Python 14 Updated Mar 29, 2025

huggingface / transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 153,132 31,252 Updated Nov 28, 2025

yogesh1q2w / Dynamic-Graph-Algorithms

C++ 16 1 Updated Mar 26, 2020

google / dopamine

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

Jupyter Notebook 10,820 1,391 Updated Nov 4, 2024

instadeepai / flashbax

⚡ Flashbax: Accelerated Replay Buffers in JAX

Python 263 21 Updated Sep 22, 2025

robfiras / loco-mujoco

Imitation learning benchmark focusing on complex locomotion tasks using MuJoCo.

Python 1,272 134 Updated May 30, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yogesh Tripathi yogesh1q2w

Achievements

Achievements

Block or report yogesh1q2w

Stars

slimRL / slimStreamQ

kenjyoung / MinAtar

Stable-Baselines-Team / stable-baselines

slimRL / slimCQL

slimRL / slimSAC

slimRL / slimBBF

DLR-RM / stable-baselines3

slimRL / slimDQN

kennethleungty / Neural-Network-Architecture-Diagrams

theovincent / EauDeDQN

theovincent / i-DQN

huggingface / transformers

yogesh1q2w / Dynamic-Graph-Algorithms

google / dopamine

instadeepai / flashbax

robfiras / loco-mujoco