Skip to content
View ethanluoyc's full-sized avatar
🏠
Working from home
🏠
Working from home

Organizations

@UCL-SML

Block or report ethanluoyc

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs

HTML 707 104 Updated Nov 28, 2025

Nano vLLM

Python 9,348 1,152 Updated Nov 3, 2025

Simplifying reinforcement learning for complex game environments

C 4,340 324 Updated Nov 28, 2025

A collection of inspiring lists, manuals, cheatsheets, blogs, hacks, one-liners, cli/web tools and more.

196,045 11,983 Updated Nov 19, 2024

A JAX research toolkit for building, editing, and visualizing neural networks.

Python 1,831 68 Updated Jun 22, 2025

Foundation Policies with Hilbert Representations (ICML 2024)

Python 102 9 Updated Sep 29, 2025

This repostory contains code and data instructions for ROAM, 3DV 2024. Authors: Wanyue Zhang, Rishabh Dabral, Thomas Leimkühler, Vladislav Golyanik†, Marc Habermann†, Christian Theobalt.

C++ 26 2 Updated Jun 21, 2024

Simulating SMPL humanoid, supporting PHC/PHC-MJX/PULSE/SimXR code bases.

Python 301 21 Updated Nov 3, 2025

A library for generative social simulation

Python 1,087 235 Updated Nov 28, 2025

This repo is the implementation of paper ''SHAQ: Incorporating Shapley Value Theory into Multi-Agent Q-Learning''.

Python 50 16 Updated Dec 4, 2023
Jupyter Notebook 18 1 Updated May 30, 2023

Library and command-line utility for rendering projects templates.

Python 2,954 234 Updated Nov 27, 2025

Minimal, clean, single-file implementations of common robotics controllers in MuJoCo.

Python 383 28 Updated Apr 16, 2024

LXM3: XManager launch backend for HPC clusters

Python 11 1 Updated May 11, 2024

A JAX-based simulator for autonomous driving research.

Python 993 122 Updated Oct 23, 2025

🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX

Python 60 5 Updated Oct 23, 2023

Corax: Core RL in JAX

Python 38 Updated Feb 22, 2024

Standalone library of frequently-used wrappers for dm_env environments.

Python 17 3 Updated Jul 9, 2024
Python 1 Updated Oct 11, 2023

A Text Based Copy of Slay The Spire entirely played in the shell.

Python 34 7 Updated Nov 17, 2025

Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 20+ clouds, or on-prem).

Python 9,026 859 Updated Nov 28, 2025

Official Algorithm Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"

Python 836 97 Updated Apr 18, 2024

RL starter files in order to immediately train, visualize and evaluate an agent without writing any line of code

Python 702 192 Updated May 12, 2024

(NeurIPS 2023) ChessGPT - Bridging Policy Learning and Language Modeling

Python 129 12 Updated Oct 26, 2023

A unified framework for robot learning

Python 599 95 Updated Nov 26, 2024

A faster pytorch implementation of faster r-cnn

Python 7,850 2,323 Updated May 20, 2022

EfficientNet, MobileNetV3, MobileNetV2, MixNet, etc in JAX w/ Flax Linen and Objax

Python 129 14 Updated Jan 4, 2024

This repository contains the implementation of the PTR algorithm described in the paper: Pre-Training for Robots: Leveraging Diverse Multitask Data via Offline Reinforcement Learning.

Python 30 3 Updated Oct 26, 2022

CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL

Python 118 11 Updated Aug 22, 2024
Next