Skip to content
View sayands's full-sized avatar
🎯
🎯

Organizations

@GradientSpaces

Block or report sayands

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[CVPR 2025] The code for paper ''Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding''.

Python 189 10 Updated Jun 4, 2025

[ECCV 2024 - Oral] ACE0 is a learning-based structure-from-motion approach that estimates camera parameters of sets of images by learning a multi-view consistent, implicit scene representation.

Python 798 56 Updated Nov 10, 2025

Code and data for UniEgoMotion (ICCV 2025)

Python 39 3 Updated Nov 11, 2025

A simple, unified multimodal models training engine. Lean, flexible, and built for hacking at scale.

Python 693 27 Updated Jan 9, 2026

STI-Bench : Are MLLMs Ready for Precise Spatial-Temporal World Understanding?

Python 35 1 Updated Jul 7, 2025

Awesome 3D Scene Graphs: a curated list of 3D scene graph generation and related resources!

89 3 Updated Dec 18, 2024

[NeurIPS 2025] GuideFlow3D: Optimization-Guided Rectified Flow For Appearance Transfer

24 1 Updated Dec 1, 2025

[CVPR 2025, Highlight] CrossOver: 3D Scene Cross-Modal Alignment

Python 197 8 Updated Jan 4, 2026

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 12,109 1,070 Updated Oct 29, 2025

【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Python 3,432 250 Updated Dec 3, 2024
Python 40 4 Updated Oct 26, 2025

🪄 Interactive Diagrams for Code

Python 930 75 Updated Jan 7, 2026

🚀 Lightning-fast computer vision models. Fine-tune SOTA models with just a few lines of code. Ready for cloud ☁️ and edge 📱 deployment.

Python 346 2 Updated Dec 11, 2025

[NeurIPS 2025, Spotlight] Rectified Point Flow: Generic Point Cloud Pose Estimation

Python 164 12 Updated Dec 2, 2025

Code for "ReSpace: Text-Driven 3D Indoor Scene Synthesis and Editing with Preference Alignment"

Python 56 2 Updated Dec 9, 2025

[RSS 2025] ROMAN: a view-invariant global localization method that matches objects from different robot views for reliable pose estimation even when a scene is observed from opposite views

Python 271 16 Updated Dec 8, 2025
JavaScript 9 Updated Mar 24, 2025

A collection of onboarding diagrams of different project online

96 5 Updated Dec 1, 2025

Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).

Python 11,534 1,073 Updated Nov 5, 2025

[ICCV 2023] SGAligner: 3D Scene Alignment with Scene Graphs

Python 109 9 Updated Oct 27, 2025

Code for the paper: "No Zero-Shot Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance" [NeurIPS'24]

Jupyter Notebook 93 4 Updated Apr 29, 2024

[ICCV 2025 Oral] SceneSplat - Gaussian Splatting-based Scene Understanding with Vision-Language Pretraining

Python 287 14 Updated Dec 11, 2025

PyTorchGeoNodes is a PyTorch module for differentiable shape programs / procedural models in forms of graphs. It can automatically translate Blender geometry node models into PyTorch code. Original…

Python 43 1 Updated Nov 23, 2025

A Framework for Open-Vocabulary Object Retrieval and Drawer Manipulation in Point Clouds

Python 29 7 Updated Jan 19, 2025

[CVPR 2025] WildGS-SLAM: Monocular Gaussian Splatting SLAM in Dynamic Environments

Python 525 52 Updated Jan 6, 2026

Spurfies: Sparse Surface Reconstruction using Local Geometry Priors

Python 28 1 Updated Nov 6, 2024

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 3,959 291 Updated Jan 5, 2026

[ICCV 2025] HouseTour: A Virtual Real Estate A(I)gent

Python 21 Updated Oct 22, 2025

🟣 Computer Vision interview questions and answers to help you prepare for your next machine learning and data science interview in 2026.

104 18 Updated Jan 4, 2026
Next