Skip to content
View zrporz's full-sized avatar

Block or report zrporz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Generative Models by Stability AI

Python 26,636 2,988 Updated Nov 3, 2025

The open-source code for the NeurIPS 2025 paper, "Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning."

Python 33 1 Updated Nov 17, 2025

[Nature Machine Intelligence 2025] Emulating Human-like Adaptive Vision for Efficient and Flexible Machine Visual Perception

Python 96 3 Updated Nov 9, 2025
Python 86 Updated Nov 7, 2025

Official code of paper "DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution"

Python 119 7 Updated Feb 14, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 16,448 2,624 Updated Nov 24, 2025

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 24,020 2,666 Updated Aug 12, 2024

[NeurIPS 2025] LangSplatV2: High-dimensional 3D Language Gaussian Splatting with 450+ FPS

Python 143 9 Updated Oct 17, 2025

The code repository of UniRL

Python 46 3 Updated May 30, 2025

[NeurIPS 2025] MMaDA - Open-Sourced Multimodal Large Diffusion Language Models

Python 1,500 74 Updated Nov 16, 2025

SC-Depth (V1, V2, and V3) for Unsupervised Monocular Depth Estimation Webpage:https://jiawangbian.github.io/sc_depth_pl/

Python 477 73 Updated Oct 6, 2023

The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."

Python 2,003 151 Updated Mar 13, 2025

Dense Prediction Transformers

Python 2,274 283 Updated Dec 18, 2024

[CVPR'25 Oral] MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision

Python 2,007 129 Updated Nov 2, 2025

(ICLR 2025 spotlight) "Poison-splat: Computation Cost Attack on 3D Gaussian Splatting"

C++ 67 3 Updated Feb 13, 2025

CODA: Repurposing Continuous VAEs for Discrete Tokenization

Python 34 1 Updated Jul 4, 2025

A curated list of recent diffusion models for video generation, editing, and various other applications.

5,211 320 Updated Oct 15, 2025

Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning

Python 298 15 Updated Mar 26, 2025

LaTeX Thesis Template for Tsinghua University

TeX 5,038 1,127 Updated Oct 19, 2025

[IROS 2025 Award Finalist] The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems

Python 2,621 181 Updated Oct 27, 2025

Official implementation of “4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models” (CVPR 2025)

Python 157 10 Updated Oct 10, 2025

DUSt3R: Geometric 3D Vision Made Easy

Python 6,745 716 Updated Sep 24, 2025

[CVPR 2025] Official repo for ART:Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation

Jupyter Notebook 352 38 Updated Aug 6, 2025

Flash3D: Feed-Forward Generalisable 3D Scene Reconstruction from a Single Image

Python 280 22 Updated Jun 2, 2025

Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch

Python 475 13 Updated Jan 12, 2025

Official inference repo for FLUX.1 models

Python 24,693 1,819 Updated Jul 31, 2025

[TPAMI 2023] SceneDreamer: Unbounded 3D Scene Generation from 2D Image Collections

Python 651 44 Updated Aug 14, 2024

Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Attention

Python 912 83 Updated Apr 17, 2024
Python 46 3 Updated Jan 2, 2025

Official implementation of Physics3D: Learning Physical Properties of 3D Gaussians via Video Diffusion

Python 223 13 Updated Jun 12, 2024
Next