-
Sun Yat-Sen University
- Guangzhou, China
-
12:44
(UTC +08:00) - https://guowei-zou.github.io/
Highlights
- Pro
Stars
Ctrl-World: A Controllable Generative World Model for Robot Manipualtion
RLinf is a flexible and scalable open-source infrastructure designed for post-training foundation models (LLMs, VLMs, VLAs) via reinforcement learning.
[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion
ManiCM: Real-time 3D Diffusion Policy via Consistency Model for Robotic Manipulation
PyTorch implementation of JiT https://arxiv.org/abs/2511.13720
Collections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning
🔥 SpatialVLA: a spatial-enhanced vision-language-action model that is trained on 1.1 Million real robot episodes. Accepted at RSS 2025.
Implementation of Dream to Manipulate: Compositional World Models Empowering Robot Imitation Learning with Imagination
A Curated List of Awesome Works in World Modeling, Aiming to Serve as a One-stop Resource for Researchers, Practitioners, and Enthusiasts Interested in World Modeling.
A curated list of large VLM-based VLA models for robotic manipulation.
Turn any webpage/Vue/React and so on into desktop and mobile app with easy in few minutes. 轻松将任意网站/Vue/React等项目构建为轻量级(小于5M)多端桌面应用和手机应用仅需几分钟. https://ppofficial.netlify.app/
Turn any webpage/Vue/React and so on into desktop and mobile app under 5M with easy in few minutes. 轻松将任意网站/Vue/React等项目构建为轻量级(小于5M)多端桌面应用和手机应用仅需几分钟. https://ppofficial.netlify.app
Turn any webpage/Vue/React and so on into desktop and mobile app with easy in few minutes. 轻松将任意网站/Vue/React等项目构建为轻量级(小于5M)多端桌面应用和手机应用仅需几分钟. https://ppofficial.netlify.app/
Simplifying diffusion/flow policies by treating action trajectories as flow trajectories
Project page of the paper "Learning Multi-Scale Photo Exposure Correction" (CVPR 2021).
Official Implementation of Rectified Flow (ICLR2023 Spotlight)
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
unofficial Split Mean Flow Implementation from bytedance
CVPR 2025: Frequency Dynamic Convolution for Dense Image Prediction
A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (VLN), and related multimodal learning approaches.
可以自己部署配置参数、或者用我部署好的edu临时邮箱;前端unicloud、后端workers;不需要任何的转发邮箱配置,你可以部署到网页、小程序、app端;
[AAAI 2026] D²PPO: Diffusion Policy Policy Optimization with Dispersive Loss.
Awesome collection of resources and papers on Diffusion Models for Robotic Manipulation.