Stars
A ROS driver for Insta360 cameras, enabling real-time image capture, processing, and publishing in ROS environments.
The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…
🔥(CVPR 2025 Highlight) Dyn-HaMR: Recovering 4D Interacting Hand Motion from a Dynamic Camera
GPU-optimized version of the MuJoCo physics simulator, designed for NVIDIA hardware.
Running VLA at 30Hz frame rate and 480Hz trajectory frequency
Inverse kinematics and subproblem solutions from "IK-Geo: Unified robot inverse kinematics using subproblem decomposition" implemented in MATLAB, C++, Rust, and Python.
MimicLabs: A Scalable Data Collection & Generation Pipeline for Table-top Manipulation
Code and model to control the AH!
HOT3D: Hand and Object Tracking in 3D from Egocentric Multi-View Videos, CVPR 2025
Codebase for DexMachina: Functional Retargeting for Bimanual Dexterous Manipulation
moojink / openvla-oft
Forked from openvla/openvlaFine-Tuning Vision-Language-Action Models: Optimizing Speed and Success
[CVPR 2025 Best Paper Nomination] FoundationStereo: Zero-Shot Stereo Matching
Official implementation for the paper "Full-Order Sampling-Based MPC for Torque-Level Locomotion Control via Diffusion-Style Annealing". DIAL-MPC is a novel sampling-based MPC framework for legged …
Accessible large language models via k-bit quantization for PyTorch.
[CVPR 2025] MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction Priors
Public code release associated with SceneScript.
Official implementation for the paper "Model-based Diffusion for Trajectory Optimization". Model-based diffusion (MBD) is a novel diffusion-based trajectory optimization framework that employs a dy…
The unitree_il_lerobot open-source project is a modification of the LeRobot open-source training framework, enabling the training and testing of data collected using the dual-arm dexterous hands of…
GTSAM is a library of C++ classes that implement smoothing and mapping (SAM) in robotics and vision, using factor graphs and Bayes networks as the underlying computing paradigm rather than sparse m…
An open-source library for GPU-accelerated robot learning and sim-to-real transfer.
A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.
Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Google Robot, WidowX+Bridge) (CoRL 2024)
A suite of image and video neural tokenizers