Captain-Xiong

Yu Chen Captain-Xiong

11 followers · 0 following

Lists (14)

Sort

Stars

MrNeRF / awesome-3D-gaussian-splatting

Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.

HTML 7,972 478 Updated Oct 1, 2025

aim-uofa / Tinker

One-shot and Few-shot 3D Editing without Per-Scene Optimization

160 11 Updated Aug 21, 2025

facebookresearch / map-anything

MapAnything: Universal Feed-Forward Metric 3D Reconstruction

Python 2,217 126 Updated Oct 24, 2025

yangzhou24 / OmniWorld

OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling

Python 387 6 Updated Nov 11, 2025

yyfz / Pi3

Code of π^3: Permutation-Equivariant Visual Geometry Learning

Python 1,352 66 Updated Sep 10, 2025

DriveVLA / OpenDriveVLA

OpenDriveVLA: Towards End-to-end Autonomous Driving with Large Vision Language Action Model

Python 465 36 Updated Nov 9, 2025

uncbiag / Awesome-Foundation-Models

A curated list of foundation models for vision and language tasks

1,116 55 Updated Jun 23, 2025

zytx121 / Awesome-VLGFM

A Survey on Vision-Language Geo-Foundation Models (VLGFMs)

175 7 Updated May 24, 2025

xmed-lab / CLIP_Surgery

[Pattern Recognition 25] CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks

Jupyter Notebook 447 27 Updated Mar 1, 2025

chaofengc / Awesome-Image-Quality-Assessment

A comprehensive collection of IQA papers

TeX 1,378 82 Updated Oct 27, 2025

amazon-science / patchcore-inspection

Python 1,046 202 Updated Jul 10, 2024

xiahaifeng1995 / PaDiM-Anomaly-Detection-Localization-master

This is an unofficial implementation of the paper “PaDiM: a Patch Distribution Modeling Framework for Anomaly Detection and Localization”.

Python 454 106 Updated Nov 29, 2023

kaustubh-sadekar / VirtualCam

Virtual camera is created only using opencv and numpy. It simulates a camera where we can control all its parameters, intrinsic and extrinsic to get a better understanding how each component in the…

Python 250 59 Updated Jul 15, 2020

nvidia-cosmos / cosmos-transfer1

Cosmos-Transfer1 is a world-to-world transfer model designed to bridge the perceptual divide between simulated and real-world environments.

Python 724 99 Updated Oct 29, 2025

Depth2World / VLADBench

Fine-Grained Evaluation of Large Vision-Language Models in Autonomous Driving (ICCV 2025)

Python 29 Updated May 29, 2025

robingg1 / PoseTraj

[CVPR 2025] PoseTraj: Pose-Aware Trajectory Control in Video Diffusion

Python 21 1 Updated Oct 11, 2025

manycore-research / SpatialLM

[NeurIPS 2025] SpatialLM: Training Large Language Models for Structured Indoor Modeling

Python 4,080 322 Updated Sep 26, 2025

ziyc / drivestudio

A 3DGS framework for omni urban scene reconstruction and simulation.

Python 1,008 110 Updated Aug 27, 2025

showlab / DragAnything

[ECCV 2024] DragAnything: Motion Control for Anything using Entity Representation

Python 501 16 Updated Jul 2, 2024

YvanYin / Metric3D

The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."

Python 1,988 149 Updated Mar 13, 2025