Skip to content
View jxxtin's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report jxxtin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official implementation of UnifiedReward & [NeurIPS 2025] UnifiedReward-Think

Python 656 37 Updated Jan 2, 2026

Gym Interface Wrapper for Simulink Models

Python 23 2 Updated Feb 14, 2025

Explore the Multimodal “Aha Moment” on 2B Model

Python 620 23 Updated Mar 18, 2025

Suite of PyBullet reinforcement learning environments targeted towards using tactile data as the main form of observation.

Python 171 24 Updated Jan 25, 2023

Muon is Scalable for LLM Training

1,391 78 Updated Aug 3, 2025

Header-Only C++ Library for Graph Representation and Algorithms

C++ 665 139 Updated Dec 22, 2025

"Sequential Dexterity: Chaining Dexterous Policies for Long-Horizon Manipulation" code repository

Python 173 16 Updated Apr 25, 2024

A generative and self-guided robotic agent that endlessly propose and master new skills.

Python 1,123 105 Updated May 31, 2024

Lightning-UQ-Box: Uncertainty Quantification for Neural Networks with PyTorch and Lightning

Python 211 23 Updated Dec 15, 2025

Inference-time scaling of diffusion-based image and video generation models.

Python 172 11 Updated Dec 17, 2025

[IROS 2025 Award Finalist] The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems

Python 2,697 189 Updated Dec 16, 2025

Official implementation of SwiftSketch

Jupyter Notebook 215 6 Updated Sep 27, 2025

robosuite: A Modular Simulation Framework and Benchmark for Robot Learning

Python 2,123 634 Updated Dec 31, 2025

[ICLR 2025] Diffusion Feedback Helps CLIP See Better

Python 300 15 Updated Jan 23, 2025

A contact solver for physics-based simulations involving 👚 shells, 🪵 solids and 🪢 rods.

Python 1,554 88 Updated Dec 29, 2025

Witness the aha moment of VLM with less than $3.

Python 4,016 289 Updated May 19, 2025

[CVPR 2025]Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation

Python 172 12 Updated Jun 20, 2025
Python 427 28 Updated Jun 12, 2025

Explorations into whether a transformer with RL can direct a genetic algorithm to converge faster

Python 71 Updated May 18, 2025

"4DGen: Grounded 4D Content Generation with Spatial-temporal Consistency", Yuyang Yin*, Dejia Xu*, Zhangyang Wang, Yao Zhao, Yunchao Wei

Python 246 12 Updated Jun 24, 2024

Official implementation of Diffusion Policy Policy Optimization, arxiv 2024

Python 722 89 Updated Feb 4, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 12,556 1,540 Updated Apr 24, 2025
Jupyter Notebook 174 22 Updated Dec 31, 2025

[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Python 1,292 107 Updated Dec 15, 2025

Assistive Gym, a physics-based simulation framework for physical human-robot interaction and robotic assistance.

Python 389 85 Updated Jan 26, 2024

Simple RL training for reasoning

Python 3,817 282 Updated Dec 23, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,658 2,236 Updated Feb 1, 2025

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Jupyter Notebook 2,468 197 Updated Dec 3, 2025

[ICLR 2025] Official implementation of "DiffSplat: Repurposing Image Diffusion Models for Scalable 3D Gaussian Splat Generation".

Python 460 27 Updated Aug 27, 2025
Next