XingruiWang

Follow

Xingrui Wang XingruiWang

Follow

CS PhD @ Johns Hopkins University Multimodal & Spatial reasoning

25 followers · 20 following

Johns Hopkins University
Baltimore, MD
19:46 (UTC -05:00)
https://xingruiwang.github.io/
@XingruiWang

Achievements

Achievements

Highlights

Pro

XingruiWang/README.md

Hi there, this is Xingrui. 👋

📫 How to reach me: [email protected]
😄 Main interest: AI, Computer Vision, Machine Learning ...
👯 More about me: https://xingruiwang.github.io/

Pinned Loading

Spatial457 Spatial457 Public

[CVPR'25] A vision question answering (VQA) benchmark for 6D spatial reasoning.

Python 15 2
open-compass/VLMEvalKit open-compass/VLMEvalKit Public

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 3.4k 550
KeyVID KeyVID Public

Offical code of paper KeyVID: Keyframe-Aware Video Diffusion for Audio-Synchronized Visual Animation.

Python 5
XModBench XModBench Public

XModBench: Benchmarking Cross-Modal Capabilities and Consistency in Omni-Language Models

Python 3
DynSuperCLEVR DynSuperCLEVR Public

A video question answering dataset that focuses on the dynamics properties of objects (velocity, acceleration) and their collisions within 4D scenes.

Python 18
3D-Aware-VQA 3D-Aware-VQA Public

Official Code for the NeurIPS'23 paper "3D-Aware Visual Question Answering about Parts, Poses and Occlusions"

Jupyter Notebook 19