zoeyuchao

🐱

Focusing

Chao Yu zoeyuchao

🐱

Focusing

Assistant Professor at Tsinghua University, interested in MARL, and Embodied AI.

529 followers · 54 following

Tsinghua University
Haidian, Beijing

Achievements

Organizations

Lists (3)

Sort

🔮 Future ideas

✨ Inspiration

🚀 My stack

Stars

thu-uav / JuggleRL_train

Python 12 Updated Oct 6, 2025

Tonghe-Zhang / Awesome-Flow-RL-Papers

A collection of paper/projects that trains flow matching model/policies via RL.

297 9 Updated Oct 9, 2025

Elessar123 / SAC-FLOW

Python 27 5 Updated Oct 1, 2025

RLinf / RLinf

RLinf is a flexible and scalable open-source infrastructure designed for post-training foundation models (LLMs, VLMs, VLAs) via reinforcement learning.

Python 1,263 118 Updated Nov 16, 2025

gen-robot / RL4VLA

Python 197 17 Updated Aug 25, 2025

OpenHelix-Team / Awesome-VLA-RL

This repository summarizes recent advances in the VLA + RL paradigm and provides a taxonomic classification of relevant works.

327 4 Updated Oct 10, 2025

RLinf / latex2sympy2

Forked from IuvenisSapiens/latex2sympy2

Parse LaTeX math expressions

Python 3 Updated Aug 19, 2025

RLinf / LLMEvalKit

Forked from QwenLM/Qwen2.5-Math

A lightweight LLM evaluation toolkit for RLinf. Support mathematical reasoning and long CoT tasks.

Python 5 Updated Sep 17, 2025

zelaix / VS-Bench

VS-Bench: Evaluating VLMs for Strategic Reasoning and Decision-Making in Multi-Agent Environments

Python 18 Updated Sep 30, 2025

thu-uav / Multi-UAV-pursuit-evasion

Multi-UAV Pursuit-Evasion with Online Planning in Unknown Environments by Deep Reinforcement Learning

Python 117 12 Updated May 15, 2025

thu-uav / SimpleFlight

What Matters in Learning A Zero-Shot Sim-to-Real RL Policy for Quadrotor Control? A Comprehensive Study

Python 69 5 Updated Jun 11, 2025

thu-ml / RoboticsDiffusionTransformer

RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation

Python 1,524 147 Updated Sep 28, 2025

openai / swarm

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 20,614 2,211 Updated Mar 11, 2025

ir-lab / bimanual-imitation

Code for paper, "A Comparison of Imitation Learning Algorithms for Bimanual Manipulation" (Drolet et al., 2024)

Python 111 5 Updated Mar 13, 2025

thu-uav / FlightBench

C++ 35 4 Updated Apr 8, 2025

eric-mitchell / direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Python 2,781 231 Updated Aug 11, 2024

ollama / ollama

Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.

Go 156,052 13,655 Updated Nov 16, 2025

marlbenchmark / on-policy

This is the official implementation of Multi-Agent PPO (MAPPO).

Python 1,760 350 Updated Jul 18, 2024

HosnLS / Hierarchical-Language-Agent

Python 37 8 Updated Jan 9, 2024

binary-husky / gpt_academic

为GPT/GLM等LLM大语言模型提供实用化交互接口，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 69,678 8,396 Updated Sep 20, 2025