Stars
Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs
Simplifying reinforcement learning for complex game environments
A collection of inspiring lists, manuals, cheatsheets, blogs, hacks, one-liners, cli/web tools and more.
A JAX research toolkit for building, editing, and visualizing neural networks.
Foundation Policies with Hilbert Representations (ICML 2024)
This repostory contains code and data instructions for ROAM, 3DV 2024. Authors: Wanyue Zhang, Rishabh Dabral, Thomas Leimkühler, Vladislav Golyanik†, Marc Habermann†, Christian Theobalt.
Simulating SMPL humanoid, supporting PHC/PHC-MJX/PULSE/SimXR code bases.
A library for generative social simulation
This repo is the implementation of paper ''SHAQ: Incorporating Shapley Value Theory into Multi-Agent Q-Learning''.
Library and command-line utility for rendering projects templates.
Minimal, clean, single-file implementations of common robotics controllers in MuJoCo.
A JAX-based simulator for autonomous driving research.
🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX
Standalone library of frequently-used wrappers for dm_env environments.
A Text Based Copy of Slay The Spire entirely played in the shell.
Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 20+ clouds, or on-prem).
Official Algorithm Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"
RL starter files in order to immediately train, visualize and evaluate an agent without writing any line of code
(NeurIPS 2023) ChessGPT - Bridging Policy Learning and Language Modeling
A faster pytorch implementation of faster r-cnn
EfficientNet, MobileNetV3, MobileNetV2, MixNet, etc in JAX w/ Flax Linen and Objax
This repository contains the implementation of the PTR algorithm described in the paper: Pre-Training for Robots: Leveraging Diverse Multitask Data via Offline Reinforcement Learning.
CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL