ethanluoyc

🏠

Working from home

Yicheng Luo ethanluoyc

🏠

Working from home

281 followers · 172 following

University College London
London, United Kingdom
luoyicheng.net
@LuoYicheng

Achievements

x3 x2

Achievements

x3 x2

Organizations

Stars

jax-ml / scaling-book

Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs

HTML 707 104 Updated Nov 28, 2025

GeeeekExplorer / nano-vllm

Nano vLLM

Python 9,348 1,152 Updated Nov 3, 2025

PufferAI / PufferLib

Simplifying reinforcement learning for complex game environments

C 4,340 324 Updated Nov 28, 2025

trimstray / the-book-of-secret-knowledge

A collection of inspiring lists, manuals, cheatsheets, blogs, hacks, one-liners, cli/web tools and more.

196,045 11,983 Updated Nov 19, 2024

google-deepmind / penzai

A JAX research toolkit for building, editing, and visualizing neural networks.

Python 1,831 68 Updated Jun 22, 2025

seohongpark / HILP

Foundation Policies with Hilbert Representations (ICML 2024)

Python 102 9 Updated Sep 29, 2025

RosettaWYzhang / Roam

This repostory contains code and data instructions for ROAM, 3DV 2024. Authors: Wanyue Zhang, Rishabh Dabral, Thomas Leimkühler, Vladislav Golyanik†, Marc Habermann†, Christian Theobalt.

C++ 26 2 Updated Jun 21, 2024

ZhengyiLuo / SMPLSim

Simulating SMPL humanoid, supporting PHC/PHC-MJX/PULSE/SimXR code bases.

Python 301 21 Updated Nov 3, 2025

google-deepmind / alphageometry

Python 4,705 550 Updated Jun 19, 2025

google-deepmind / concordia

A library for generative social simulation

Python 1,087 235 Updated Nov 28, 2025

hsvgbkhgbv / shapley-q-learning

This repo is the implementation of paper ''SHAQ: Incorporating Shapley Value Theory into Multi-Agent Q-Learning''.

Python 50 16 Updated Dec 4, 2023

davidbrandfonbrener / imitation_pretraining

Jupyter Notebook 18 1 Updated May 30, 2023

copier-org / copier

Library and command-line utility for rendering projects templates.

Python 2,954 234 Updated Nov 27, 2025

kevinzakka / mjctrl

Minimal, clean, single-file implementations of common robotics controllers in MuJoCo.

Python 383 28 Updated Apr 16, 2024

ethanluoyc / lxm3

LXM3: XManager launch backend for HPC clusters

Python 11 1 Updated May 11, 2024

waymo-research / waymax

A JAX-based simulator for autonomous driving research.

Python 993 122 Updated Oct 23, 2025

instadeepai / sebulba

🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX

Python 60 5 Updated Oct 23, 2023

ethanluoyc / corax

Corax: Core RL in JAX

Python 38 Updated Feb 22, 2024

kevinzakka / dm_env_wrappers

Standalone library of frequently-used wrappers for dm_env environments.

Python 17 3 Updated Jul 9, 2024

dtch1997 / d4rl-slim-benchmark

Python 1 Updated Oct 11, 2023

Difio3333 / slaythetext

A Text Based Copy of Slay The Spire entirely played in the shell.

Python 34 7 Updated Nov 17, 2025

skypilot-org / skypilot

Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 20+ clouds, or on-prem).

Python 9,026 859 Updated Nov 28, 2025

vimalabs / VIMA

Official Algorithm Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"

Python 836 97 Updated Apr 18, 2024

lcswillems / rl-starter-files

RL starter files in order to immediately train, visualize and evaluate an agent without writing any line of code

Python 702 192 Updated May 12, 2024

waterhorse1 / ChessGPT

(NeurIPS 2023) ChessGPT - Bridging Policy Learning and Language Modeling

Python 129 12 Updated Oct 26, 2023

vikashplus / robohive

A unified framework for robot learning

Python 599 95 Updated Nov 26, 2024

jwyang / faster-rcnn.pytorch

A faster pytorch implementation of faster r-cnn

Python 7,850 2,323 Updated May 20, 2022

rwightman / efficientnet-jax

EfficientNet, MobileNetV3, MobileNetV2, MixNet, etc in JAX w/ Flax Linen and Objax

Python 129 14 Updated Jan 4, 2024

Asap7772 / PTR

This repository contains the implementation of the PTR algorithm described in the paper: Pre-Training for Robots: Leveraging Diverse Multitask Data via Offline Reinforcement Learning.

Python 30 3 Updated Oct 26, 2022

vwxyzjn / cleanba

CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL

Python 118 11 Updated Aug 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yicheng Luo ethanluoyc

Achievements

Achievements

Organizations

Block or report ethanluoyc

Stars

jax-ml / scaling-book

GeeeekExplorer / nano-vllm

PufferAI / PufferLib

trimstray / the-book-of-secret-knowledge

google-deepmind / penzai

seohongpark / HILP

RosettaWYzhang / Roam

ZhengyiLuo / SMPLSim

google-deepmind / alphageometry

google-deepmind / concordia

hsvgbkhgbv / shapley-q-learning

davidbrandfonbrener / imitation_pretraining

copier-org / copier

kevinzakka / mjctrl

ethanluoyc / lxm3

waymo-research / waymax

instadeepai / sebulba

ethanluoyc / corax

kevinzakka / dm_env_wrappers

dtch1997 / d4rl-slim-benchmark

Difio3333 / slaythetext

skypilot-org / skypilot

vimalabs / VIMA

lcswillems / rl-starter-files

waterhorse1 / ChessGPT

vikashplus / robohive

jwyang / faster-rcnn.pytorch

rwightman / efficientnet-jax

Asap7772 / PTR

vwxyzjn / cleanba