Stars
[CVPR 2022] Pre-Training 3D Point Cloud Transformers with Masked Point Modeling
[ECCV2022] Masked Autoencoders for Point Cloud Self-supervised Learning
[ICML 2023] Contrast with Reconstruct: Contrastive 3D Representation Learning Guided by Generative Pretraining
3DAffordSplat: Efficient Affordance Reasoning with 3D Gaussians (ACM MM 25)
Boost segmentation model mIoU/Dice instantly WITHOUT retraining. A plug-and-play, training-free optimization module. Published in NeurIPS & JMLR. Compatible with SAM, DeepLab, SegFormer, and more. 🧩
Official implementation of "Robo-Dopamine: General Process Reward Modeling for High-Precision Robotic Manipulation"
[CVPR 2025] GEAL: Generalizable 3D Affordance Learning with Cross-Modal Consistency
Official implementation of Chain-of-Action: Trajectory Autoregressive Modeling for Robotic Manipulation. Accepted in NeurIPS 2025.
The official implement of VITA, VITA15, LongVITA, VITA-Audio, VITA-VLA, and VITA-E.
Mastering Diverse Domains through World Models
RLinf: Reinforcement Learning Infrastructure for Embodied and Agentic AI
[NAACL25] Self-Pluralising Culture Alignment for Large Language Models
U-Arm: Lerobot-Everything-Cross-Embodiment-Teleoperation
Official code for "Embodied-R1: Reinforced Embodied Reasoning for General Robotic Manipulation"
this project provide a verity of code help you collect data from your robotic arm, have fun!
official repo for AGNOSTOS, a cross-task manipulation benchmark, and X-ICM method, a cross-task in-context manipulation (VLA) method
Heterogeneous Pre-trained Transformer (HPT) as Scalable Policy Learner.
🌴[CVPR 2024] OakInk2: A Dataset of Bimanual Hands-Object Manipulation in Complex Task Completion
[CVPR 2025 Oral] TokenHSI: Unified Synthesis of Physical Human-Scene Interactions through Task Tokenization
SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning
Affordance-based Robot Manipulation with Flow Matching
TorchCFM: a Conditional Flow Matching library
[AAAI 2025 Oral] FlowPolicy: Enabling Fast and Robust 3D Flow-based Policy via Consistency Flow Matching for Robot Manipulation
[RSS 2024] 3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations