Skip to content
View daniilpyatko's full-sized avatar

Highlights

  • Pro

Block or report daniilpyatko

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official codebase for "Quantile Reward Policy Optimization: Alignment with Pointwise Regression and Exact Partition Functions" (Matrenok et al. 2025).

27 2 Updated Nov 6, 2025

Enable AI to ask anyone anything 🤝

Python 67 5 Updated Jun 4, 2025

Codebase to fully reproduce the results of "No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO" (Moalla et al. 2024). Uses TorchRL and provides extensive tool…

Python 29 3 Updated Nov 20, 2024

A template for starting reproducible Python machine-learning projects with hardware acceleration. Find an example at https://github.com/CLAIRE-Labo/no-representation-no-trust

Shell 112 7 Updated Jun 6, 2025

Анализ переноса обучения в задаче определения тональности

Jupyter Notebook 3 Updated Dec 14, 2024

Sources of some algorithms and data structures

C++ 2 Updated Jun 18, 2024

Clean minimalist implementations of popular competitive programming algorithms

C++ 219 29 Updated Nov 17, 2020