Skip to content
View WINUprj's full-sized avatar

Block or report WINUprj

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. Stored as pure Python. All in a modern, AI-native editor.

Python 18,256 855 Updated Jan 7, 2026

日本語LLMまとめ - Overview of Japanese LLMs

TypeScript 1,328 40 Updated Jan 6, 2026

simple JAX-/NumPy-based implementations of NGD with exact/approximate Fisher Information Matrix both in parameter-space and function-space (by empirical/analytical NTK).

Python 15 3 Updated Oct 21, 2020

On the Theoretical Limitations of Embedding-Based Retrieval

Jupyter Notebook 618 47 Updated Sep 15, 2025

Really Fast End-to-End Jax RL Implementations

Python 1,009 82 Updated Sep 9, 2024

⚡ Flashbax: Accelerated Replay Buffers in JAX

Python 268 21 Updated Sep 22, 2025

Agar.io for Continual Reinforcement Learning

C++ 24 1 Updated Jul 24, 2025

repository to research & share the machine learning articles

3,903 200 Updated Jul 1, 2022

🪐 Markdown with superpowers: from ideas to papers, presentations, websites, books, and knowledge bases.

Kotlin 9,794 239 Updated Jan 7, 2026

This repository is a curated collection of information (keywords, papers, libraries, books, etc.) about counterfactual explanations🙃 Contributions are welcome! Our maintenance capacity is limited, …

23 Updated Oct 27, 2022

This is the official repository for the paper "Flora: Low-Rank Adapters Are Secretly Gradient Compressors" in ICML 2024.

Python 105 5 Updated Jul 1, 2024

Deep reinforcement learning without experience replay, target networks, or batch updates.

Python 272 32 Updated Mar 18, 2025

A repository for collating all the resources such as articles, blogs, papers, and books related to Bayesian Statistics.

118 26 Updated Nov 2, 2021
TeX 5 Updated Feb 20, 2025

Code for ICML 2018 paper on "Fast and Scalable Bayesian Deep Learning by Weight-Perturbation in Adam" by Khan, Nielsen, Tangkaratt, Lin, Gal, and Srivastava

MATLAB 112 22 Updated Dec 19, 2018

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 8,762 952 Updated Jul 8, 2025

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 11,911 1,738 Updated Dec 19, 2025

Simple Distributed Reinforcement Learning Framework(シンプルな分散強化学習フレームワーク)

Python 58 6 Updated Nov 15, 2025

[ICLR 2025] AdaFisher: Adaptive Second Order Optimization via Fisher Information

Python 49 4 Updated Feb 7, 2025

Modularized Implementation of Deep RL Algorithms in PyTorch

Python 3,392 696 Updated Apr 16, 2024

Explorer is a PyTorch reinforcement learning framework for exploring new ideas.

Python 97 14 Updated Jun 19, 2025

A fast and robust algorithm for temporal difference learning

C++ 22 3 Updated Dec 2, 2025

An Extendible (General) Continual Learning Framework based on Pytorch - official codebase of Dark Experience for General Continual Learning

Python 767 145 Updated Nov 22, 2025

[ICLR 24 Oral/Outstanding Paper Honorable Mention Award 🎉]

Python 39 3 Updated Apr 21, 2024

Awesome Incremental Learning

4,356 623 Updated Jan 7, 2026

A hyperparameter optimization framework

Python 13,326 1,227 Updated Jan 7, 2026

maximal update parametrization (µP)

Jupyter Notebook 1,653 103 Updated Jul 17, 2024

PyTorch implementation of various methods for continual learning (XdG, EWC, SI, LwF, FROMP, DGR, BI-R, ER, A-GEM, iCaRL, Generative Classifier) in three different scenarios.

Jupyter Notebook 1,809 344 Updated Nov 5, 2025

Constrained optimization toolkit for PyTorch

Python 706 35 Updated Jul 29, 2025

Demonstrations of Loss of Plasticity and Implementation of Continual Backpropagation

Python 360 80 Updated Sep 2, 2025
Next