Skip to content
View Leslie1103's full-sized avatar

Block or report Leslie1103

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

本人的科研经验

9,756 523 Updated Jan 10, 2026

Official implementation of "OpenCity3D: What do Vision-Language Models know about Urban Environments?" @ WACV2025

Jupyter Notebook 15 Updated Nov 24, 2024

[CVPR 2025] Code for Segment Any Motion in Videos

Jupyter Notebook 450 37 Updated Jun 10, 2025

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 53,127 6,201 Updated Sep 18, 2024

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

Python 6,913 509 Updated Dec 13, 2025

assistant tools for attention visualization in deep learning

Jupyter Notebook 1,256 91 Updated Jun 9, 2022

[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 7,387 738 Updated Jan 22, 2025

The official implementation of the paper “VGGT4D: Mining Motion Cues in Visual Geometry Transformers for 4D Scene Reconstruction.”

Python 159 9 Updated Dec 2, 2025

Code of π^3: Permutation-Equivariant Visual Geometry Learning

Python 1,541 87 Updated Jan 9, 2026

MapAnything: Universal Feed-Forward Metric 3D Reconstruction

Python 2,648 170 Updated Jan 8, 2026

[CVPR 2025 Highlight] Video Depth Anything: Consistent Depth Estimation for Super-Long Videos

Python 1,677 143 Updated Oct 7, 2025

The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…

Python 6,936 841 Updated Jan 7, 2026

CUDA accelerated rasterization of gaussian splatting

Cuda 4,284 661 Updated Nov 18, 2025

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 12,179 1,291 Updated Oct 11, 2025

Bundler Structure from Motion Toolkit

C 1,575 480 Updated May 13, 2019

COLMAP - Structure-from-Motion and Multi-View Stereo

C++ 10,656 1,863 Updated Jan 11, 2026

[NeurIPS 2025] Official code for Reconstruct, Inpaint, Test-Time Finetune: Dynamic Novel-view Synthesis from Monocular Videos

Python 82 3 Updated Dec 30, 2025

A data generation pipeline for creating semi-realistic synthetic multi-object videos with rich annotations such as instance segmentation masks, depth maps, and optical flow.

Jupyter Notebook 2,646 264 Updated May 6, 2025
Python 1,211 88 Updated Aug 2, 2025

PyTorch tutorials.

Python 8,968 4,333 Updated Jan 8, 2026

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 154,902 31,692 Updated Jan 9, 2026

Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"

Python 20,267 2,880 Updated Oct 17, 2025

[CVPR 2024] 4D Gaussian Splatting for Real-Time Dynamic Scene Rendering

Jupyter Notebook 3,292 312 Updated Oct 27, 2024

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 41,263 3,224 Updated Jan 9, 2026

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 10,256 757 Updated Jan 10, 2026

Wan: Open and Advanced Large-Scale Video Generative Models

Python 13,502 1,601 Updated Dec 17, 2025