Skip to content
View zoeyuchao's full-sized avatar
🐱
Focusing
🐱
Focusing
  • Tsinghua University
  • Haidian, Beijing

Organizations

@efc-robot @marlbenchmark @staghuntrpg

Block or report zoeyuchao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 12 Updated Oct 6, 2025

A collection of paper/projects that trains flow matching model/policies via RL.

297 9 Updated Oct 9, 2025
Python 27 5 Updated Oct 1, 2025

RLinf is a flexible and scalable open-source infrastructure designed for post-training foundation models (LLMs, VLMs, VLAs) via reinforcement learning.

Python 1,263 118 Updated Nov 16, 2025
Python 197 17 Updated Aug 25, 2025

This repository summarizes recent advances in the VLA + RL paradigm and provides a taxonomic classification of relevant works.

327 4 Updated Oct 10, 2025

Parse LaTeX math expressions

Python 3 Updated Aug 19, 2025

A lightweight LLM evaluation toolkit for RLinf. Support mathematical reasoning and long CoT tasks.

Python 5 Updated Sep 17, 2025

VS-Bench: Evaluating VLMs for Strategic Reasoning and Decision-Making in Multi-Agent Environments

Python 18 Updated Sep 30, 2025

Multi-UAV Pursuit-Evasion with Online Planning in Unknown Environments by Deep Reinforcement Learning

Python 117 12 Updated May 15, 2025

What Matters in Learning A Zero-Shot Sim-to-Real RL Policy for Quadrotor Control? A Comprehensive Study

Python 69 5 Updated Jun 11, 2025

RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation

Python 1,524 147 Updated Sep 28, 2025

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 20,614 2,211 Updated Mar 11, 2025

Code for paper, "A Comparison of Imitation Learning Algorithms for Bimanual Manipulation" (Drolet et al., 2024)

Python 111 5 Updated Mar 13, 2025
C++ 35 4 Updated Apr 8, 2025

Reference implementation for DPO (Direct Preference Optimization)

Python 2,781 231 Updated Aug 11, 2024

Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.

Go 156,052 13,655 Updated Nov 16, 2025

This is the official implementation of Multi-Agent PPO (MAPPO).

Python 1,760 350 Updated Jul 18, 2024

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 69,678 8,396 Updated Sep 20, 2025
Python 41 7 Updated Sep 9, 2024

This is a repository for Hidden-utility Self-Play.

JavaScript 26 1 Updated Jul 27, 2023

Code for "On the Utility of Learning about Humans for Human-AI Coordination"

Python 109 45 Updated Apr 17, 2023

Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.

Python 915 142 Updated Dec 20, 2023

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Python 15,429 2,194 Updated Jul 24, 2024

Repository Containing the Code associated with the Paper: "Learning High-Speed Flight in the Wild"

C++ 734 185 Updated Jan 23, 2023

aw_nas: A Modularized and Extensible NAS Framework

Python 251 33 Updated Oct 3, 2023

SLAM algorithms and systems based on Neural Networks.

117 15 Updated Mar 13, 2020

Leveraging system development and robot deployment for ground-based autonomous navigation and exploration.

C++ 851 211 Updated Jul 29, 2024

MineRL Competition for Sample Efficient Reinforcement Learning - Python Package

Java 877 166 Updated Jan 22, 2025
Next