Skip to content
View chongyi-zheng's full-sized avatar

Block or report chongyi-zheng

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Flax is a neural network library for JAX that is designed for flexibility.

Jupyter Notebook 7,008 778 Updated Jan 8, 2026

Mastering Diverse Domains through World Models

Python 2,642 436 Updated Sep 23, 2025

Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos

Python 1,616 163 Updated Sep 3, 2025

Benchmarking the Spectrum of Agent Capabilities

Python 503 88 Updated Jan 23, 2024

Evaluating long-term memory of reinforcement learning algorithms

Python 160 17 Updated Jun 23, 2023

Decoupled Q-Chunking

Python 50 3 Updated Dec 12, 2025

(Crafter + NetHack) in JAX. ICML 2024 Spotlight.

Python 362 42 Updated Jul 7, 2025

A framework for creating rich, 3D, Minecraft-like single and multi-agent environments for AI research. (Accepted at ICML 2025).

C++ 149 12 Updated Nov 28, 2025

Code for "Transitive RL: Value Learning via Divide and Conquer"

Python 45 3 Updated Oct 31, 2025

The first behavioral foundation model to control a virtual physics-based humanoid agent for a wide range of whole-body tasks.

Python 705 69 Updated Jun 10, 2025

Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"

Python 26 2 Updated Oct 14, 2025

Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.

Python 356 54 Updated Jun 23, 2025

JAX implementation of VQVAE/VQGAN autoencoders (+FSQ)

Python 40 7 Updated Jun 6, 2024
Python 339 42 Updated Nov 26, 2025
Python 133 8 Updated Dec 9, 2025
Python 655 51 Updated Apr 12, 2025
Python 250 22 Updated Apr 18, 2024
Python 122 5 Updated Jun 11, 2025

[EMNLP 2025 Main] AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time

Python 89 5 Updated Jun 10, 2025

Code for A Multi-Region Brain Model to Elucidate the Role of Hippocampus in Spatially Embedded Decision-Making (ICML 2025)

Python 4 Updated Jun 17, 2025

[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Python 1,301 108 Updated Dec 15, 2025

The official implementation of "Horizon Reduction Makes RL Scalable"

Python 179 12 Updated Aug 2, 2025
Python 75 4 Updated May 31, 2025
Python 409 44 Updated Oct 12, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.

Python 50,486 4,171 Updated Jan 8, 2026

Extreme Q-Learning: Max Entropy RL without Entropy

Python 87 11 Updated Feb 14, 2023

PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO

Python 7,394 1,022 Updated Jul 3, 2024

Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"

Python 57 2 Updated Mar 26, 2024

Foundation Policies with Hilbert Representations (ICML 2024)

Python 104 9 Updated Sep 29, 2025
Next