Vincentqyw

Keep Your Curiosity.

Realcat Vincentqyw

Keep Your Curiosity.

⭐️Focusing on Visual Localization, SfM and SLAM.

717 followers · 326 following

THU
20:20 (UTC +08:00)
@AlphaRealcat

Achievements

x2 x3

Achievements

x2 x3

Lists (28)

Sort

Starred repositories

JiahaoPlus / EvoWorld

EvoWorld: Evolving Panoramic World Generation with Explicit 3D Memory

Python 35 Updated Oct 2, 2025

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 10,810 972 Updated Oct 13, 2025

93won / lightweight_vio

Lightweight Stereo VIO

C++ 220 27 Updated Oct 14, 2025

msilaev / VGGT-Long-Gsplat

Forked from DengKaiCQ/VGGT-Long

Fork of VGGT-Long with Colmap sparse model for Gaussian splatting

Python 9 Updated Oct 9, 2025

gangweix / pixel-perfect-depth

[NeurIPS 2025] Pixel-Perfect Depth

Python 462 13 Updated Oct 13, 2025

uzh-rpg / hdvio2.0

C++ 28 1 Updated Aug 31, 2025

VladimirYugay / VoT

Visual Odometry with Transformers

47 1 Updated Oct 5, 2025

hslr-s / sun-panel

A server, NAS navigation panel, Homepage, browser homepage. | 一个服务器、NAS导航面板、Homepage、浏览器首页。

Vue 4,511 506 Updated Aug 10, 2025

Open-Dev-Society / OpenStock

OpenStock is an open-source alternative to expensive market platforms. Track real-time prices, set personalized alerts, and explore detailed company insights — built openly, for everyone, forever f…

TypeScript 1,861 215 Updated Oct 12, 2025

EnVision-Research / DA-2

Official Implementation of DA^2: Depth Anything in Any Direction

Python 157 14 Updated Oct 12, 2025

wlfeng0509 / QuantVGGT

Code for QuantVGGT: Quantized Visual Geometry Grounded Transformer

Python 66 2 Updated Oct 10, 2025

Inception3D / TTT3R

A simple state update rule to enhance length generalization for CUT3R

Python 420 8 Updated Oct 1, 2025

cvg / lamaria

Benchmarking Visual-Inertial SLAM at City Scale (ICCV 2025).

Python 92 3 Updated Oct 14, 2025

google-deepmind / romo

Python 14 Updated Sep 25, 2025

microsoft / ml-fundamentals

A repository containing useful resources to learn Machine Learning Fundamentals

Jupyter Notebook 3 Updated Oct 12, 2025

bytedance / Sa2VA

🔥 Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Python 1,297 82 Updated Sep 8, 2025

jcliu0428 / ZeroPlane

[CVPR 2025] Towards In-the-wild 3D Plane Reconstruction from a Single Image

Python 62 Updated Oct 6, 2025

GREAT-WHU / MASt3R-Fusion

Integrating Feed-Forward Visual Model with IMU, GNSS for High-Functionality SLAM.

126 4 Updated Oct 1, 2025

nv-tlabs / lyra

Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation

Python 505 26 Updated Oct 2, 2025

facebookresearch / cwm

Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.

Python 668 54 Updated Sep 24, 2025

cvg / ActLoc

[CoRL 2025] ActLoc: Learning to Localize on the Move via Active Viewpoint Selection

Python 56 1 Updated Sep 21, 2025

tobiasfshr / flowr

FlowR: Flowing from Sparse to Dense 3D Reconstructions (ICCV'25 Highlight)

23 Updated Sep 20, 2025

Gar-b-age / CookLikeHOC

🥢像老乡鸡🐔那样做饭。主要部分于2024年完工，非老乡鸡官方仓库。文字来自《老乡鸡菜品溯源报告》，并做归纳、编辑与整理。CookLikeHOC.

JavaScript 21,032 2,089 Updated Sep 25, 2025

QwenLM / Qwen2.5-Omni

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 3,704 290 Updated Jun 12, 2025

jmanhype / vggt-mps

VGGT 3D Vision Agent optimized for Apple Silicon with Metal Performance Shaders

Python 68 7 Updated Sep 19, 2025

facebookresearch / map-anything

MapAnything: Universal Feed-Forward Metric 3D Reconstruction

Python 1,964 106 Updated Oct 9, 2025

qpsolvers / qpsolvers

Quadratic programming solvers in Python with a unified API

Python 695 98 Updated Aug 7, 2025

NVlabs / Mosaic3D

Python 42 1 Updated Sep 22, 2025

sentient-agi / ROMA

Recursive-Open-Meta-Agent v0.1 (Beta). A meta-agent framework to build high-performance multi-agent systems.

Realcat Vincentqyw

Lists (28)

3D Recon

Awesome AGI

Datasets

Depth

GAN

Image Matching

Image Retrieval

Keypoint Detection

Light Field Depth Estimation

LLM

Loss

NAS

NeRF

Optical Flow

Parsing

Plots

Resume

SAM

SLAM

Style Transfer

System design

TK

Tools

tooth

🥇 Top Stars

Tracking

Visual Localization

WebDev

Starred repositories

depth-completion

deep-features

feature-extraction

slam-algorithms

slam

3D

cvpr2019