jxxtin

🎯

Focusing

jxxtin

🎯

Focusing

7 followers · 104 following

Stars

CodeGoat24 / UnifiedReward

Official implementation of UnifiedReward & [NeurIPS 2025] UnifiedReward-Think

Python 656 37 Updated Jan 2, 2026

johbrust / simulink_gym

Gym Interface Wrapper for Simulink Models

Python 23 2 Updated Feb 14, 2025

turningpoint-ai / VisualThinker-R1-Zero

Explore the Multimodal “Aha Moment” on 2B Model

Python 620 23 Updated Mar 18, 2025

ac-93 / tactile_gym

Suite of PyBullet reinforcement learning environments targeted towards using tactile data as the main form of observation.

Python 171 24 Updated Jan 25, 2023

MoonshotAI / Moonlight

Muon is Scalable for LLM Training

1,391 78 Updated Aug 3, 2025

ZigRazor / CXXGraph

Header-Only C++ Library for Graph Representation and Algorithms

C++ 665 139 Updated Dec 22, 2025

sequential-dexterity / SeqDex

"Sequential Dexterity: Chaining Dexterous Policies for Long-Horizon Manipulation" code repository

Python 173 16 Updated Apr 25, 2024

Genesis-Embodied-AI / RoboGen

A generative and self-guided robotic agent that endlessly propose and master new skills.

Python 1,123 105 Updated May 31, 2024

lightning-uq-box / lightning-uq-box

Lightning-UQ-Box: Uncertainty Quantification for Neural Networks with PyTorch and Lightning

Python 211 23 Updated Dec 15, 2025

sayakpaul / tt-scale-flux

Inference-time scaling of diffusion-based image and video generation models.

Python 172 11 Updated Dec 17, 2025

OpenDriveLab / AgiBot-World

[IROS 2025 Award Finalist] The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems

Python 2,697 189 Updated Dec 16, 2025

swiftsketch / SwiftSketch

Official implementation of SwiftSketch

Jupyter Notebook 215 6 Updated Sep 27, 2025

ARISE-Initiative / robosuite

robosuite: A Modular Simulation Framework and Benchmark for Robot Learning

Python 2,123 634 Updated Dec 31, 2025

baaivision / DIVA

[ICLR 2025] Diffusion Feedback Helps CLIP See Better

Python 300 15 Updated Jan 23, 2025

st-tech / ppf-contact-solver

A contact solver for physics-based simulations involving 👚 shells, 🪵 solids and 🪢 rods.

Python 1,554 88 Updated Dec 29, 2025

StarsfieldAI / R1-V

Witness the aha moment of VLM with less than $3.

Python 4,016 289 Updated May 19, 2025

PKU-HMI-Lab / LIFT3D

[CVPR 2025]Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation

Python 172 12 Updated Jun 20, 2025

Physical-Intelligence / openpi

Python 9,620 1,296 Updated Dec 27, 2025

LeCAR-Lab / HumanoidVerse

Python 427 28 Updated Jun 12, 2025

lucidrains / transformer-directed-evolution

Explorations into whether a transformer with RL can direct a genetic algorithm to converge faster

Python 71 Updated May 18, 2025

VITA-Group / 4DGen

"4DGen: Grounded 4D Content Generation with Spatial-temporal Consistency", Yuyang Yin*, Dejia Xu*, Zhangyang Wang, Yao Zhao, Yunchao Wei

Python 246 12 Updated Jun 24, 2024

irom-princeton / dppo

Official implementation of Diffusion Policy Policy Optimization, arxiv 2024

Python 722 89 Updated Feb 4, 2025

Jiayi-Pan / TinyZero

Minimal reproduction of DeepSeek R1-Zero

Python 12,556 1,540 Updated Apr 24, 2025

kunifujiwara / VoxCity

Jupyter Notebook 174 22 Updated Dec 31, 2025

open-thought / reasoning-gym

[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Python 1,292 107 Updated Dec 15, 2025

Healthcare-Robotics / assistive-gym

Assistive Gym, a physics-based simulation framework for physical human-robot interaction and robotic assistance.

Python 389 85 Updated Jan 26, 2024

hkust-nlp / simpleRL-reason

Simple RL training for reasoning

Python 3,817 282 Updated Dec 23, 2025

deepseek-ai / Janus

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,658 2,236 Updated Feb 1, 2025

mll-lab-nu / RAGEN

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Jupyter Notebook 2,468 197 Updated Dec 3, 2025

chenguolin / DiffSplat

[ICLR 2025] Official implementation of "DiffSplat: Repurposing Image Diffusion Models for Scalable 3D Gaussian Splat Generation".

Python 460 27 Updated Aug 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly