Skip to content
View Captain-Xiong's full-sized avatar

Block or report Captain-Xiong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.

HTML 7,972 478 Updated Oct 1, 2025

One-shot and Few-shot 3D Editing without Per-Scene Optimization

160 11 Updated Aug 21, 2025

MapAnything: Universal Feed-Forward Metric 3D Reconstruction

Python 2,217 126 Updated Oct 24, 2025

OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling

Python 387 6 Updated Nov 11, 2025

Code of π^3: Permutation-Equivariant Visual Geometry Learning

Python 1,352 66 Updated Sep 10, 2025

OpenDriveVLA: Towards End-to-end Autonomous Driving with Large Vision Language Action Model

Python 465 36 Updated Nov 9, 2025

A curated list of foundation models for vision and language tasks

1,116 55 Updated Jun 23, 2025

A Survey on Vision-Language Geo-Foundation Models (VLGFMs)

175 7 Updated May 24, 2025

[Pattern Recognition 25] CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks

Jupyter Notebook 447 27 Updated Mar 1, 2025

A comprehensive collection of IQA papers

TeX 1,378 82 Updated Oct 27, 2025

This is an unofficial implementation of the paper “PaDiM: a Patch Distribution Modeling Framework for Anomaly Detection and Localization”.

Python 454 106 Updated Nov 29, 2023

Virtual camera is created only using opencv and numpy. It simulates a camera where we can control all its parameters, intrinsic and extrinsic to get a better understanding how each component in the…

Python 250 59 Updated Jul 15, 2020

Cosmos-Transfer1 is a world-to-world transfer model designed to bridge the perceptual divide between simulated and real-world environments.

Python 724 99 Updated Oct 29, 2025

Fine-Grained Evaluation of Large Vision-Language Models in Autonomous Driving (ICCV 2025)

Python 29 Updated May 29, 2025

[CVPR 2025] PoseTraj: Pose-Aware Trajectory Control in Video Diffusion

Python 21 1 Updated Oct 11, 2025

[NeurIPS 2025] SpatialLM: Training Large Language Models for Structured Indoor Modeling

Python 4,080 322 Updated Sep 26, 2025

A 3DGS framework for omni urban scene reconstruction and simulation.

Python 1,008 110 Updated Aug 27, 2025

[ECCV 2024] DragAnything: Motion Control for Anything using Entity Representation

Python 501 16 Updated Jul 2, 2024

The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."

Python 1,988 149 Updated Mar 13, 2025

[TPAMI 2025] Benchmarking and Improving Bird's Eye View Perception Robustness in Autonomous Driving

Python 385 35 Updated Feb 18, 2025

Official code repo of ICLR'25 paper: MamBEV: Enabling State Space Models to Learn Birds-Eye-View Representations

Python 25 1 Updated Oct 15, 2025

深度学习经典、新论文逐段精读

31,907 2,741 Updated Mar 22, 2025

[CVPR 2025] UniScene: Unified Occupancy-centric Driving Scene Generation

Python 512 23 Updated Oct 30, 2025

Awesome papers about Multi-Camera 3D Object Detection and Segmentation in Bird's-Eye-View, such as DETR3D, BEVDet, BEVFormer, BEVDepth, UniAD

1,079 110 Updated Apr 26, 2024

Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm

Python 669 32 Updated Sep 19, 2022

Visualizations for machine learning datasets

Jupyter Notebook 7,379 888 Updated May 24, 2023

An open source implementation of CLIP.

Python 12,922 1,195 Updated Nov 4, 2025

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 5,991 567 Updated Feb 26, 2025

A curated list of papers, datasets and resources pertaining to open vocabulary object detection.

375 23 Updated May 13, 2025
Next