Lists (1)
Sort Name ascending (A-Z)
Stars
A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. Stored as pure Python. All in a modern, AI-native editor.
日本語LLMまとめ - Overview of Japanese LLMs
simple JAX-/NumPy-based implementations of NGD with exact/approximate Fisher Information Matrix both in parameter-space and function-space (by empirical/analytical NTK).
On the Theoretical Limitations of Embedding-Based Retrieval
Really Fast End-to-End Jax RL Implementations
⚡ Flashbax: Accelerated Replay Buffers in JAX
Agar.io for Continual Reinforcement Learning
repository to research & share the machine learning articles
🪐 Markdown with superpowers: from ideas to papers, presentations, websites, books, and knowledge bases.
This repository is a curated collection of information (keywords, papers, libraries, books, etc.) about counterfactual explanations🙃 Contributions are welcome! Our maintenance capacity is limited, …
This is the official repository for the paper "Flora: Low-Rank Adapters Are Secretly Gradient Compressors" in ICML 2024.
Deep reinforcement learning without experience replay, target networks, or batch updates.
A repository for collating all the resources such as articles, blogs, papers, and books related to Bayesian Statistics.
Code for ICML 2018 paper on "Fast and Scalable Bayesian Deep Learning by Weight-Perturbation in Adam" by Khan, Nielsen, Tangkaratt, Lin, Gal, and Srivastava
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
Simple Distributed Reinforcement Learning Framework(シンプルな分散強化学習フレームワーク)
[ICLR 2025] AdaFisher: Adaptive Second Order Optimization via Fisher Information
Modularized Implementation of Deep RL Algorithms in PyTorch
Explorer is a PyTorch reinforcement learning framework for exploring new ideas.
A fast and robust algorithm for temporal difference learning
An Extendible (General) Continual Learning Framework based on Pytorch - official codebase of Dark Experience for General Continual Learning
[ICLR 24 Oral/Outstanding Paper Honorable Mention Award 🎉]
PyTorch implementation of various methods for continual learning (XdG, EWC, SI, LwF, FROMP, DGR, BI-R, ER, A-GEM, iCaRL, Generative Classifier) in three different scenarios.
Demonstrations of Loss of Plasticity and Implementation of Continual Backpropagation