sayands

🎯

Sayan Deb Sarkar sayands

🎯

Computer Vision PhD @ Stanford | CS MSc @ ETH Zurich | 3D Scene Understanding

338 followers · 14 following

Stanford
Stanford, CA
19:15 (UTC -08:00)
sayands.github.io
@debsarkar_sayan
@sayandsarkar.bsky.social

Achievements

x2 x2

Achievements

x2 x2

Organizations

Stars

LaVi-Lab / Video-3D-LLM

[CVPR 2025] The code for paper ''Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding''.

Python 189 10 Updated Jun 4, 2025

nianticlabs / acezero

[ECCV 2024 - Oral] ACE0 is a learning-based structure-from-motion approach that estimates camera parameters of sets of images by learning a multi-view consistent, implicit scene representation.

Python 798 56 Updated Nov 10, 2025

chaitanya100100 / UniEgoMotion

Code and data for UniEgoMotion (ICCV 2025)

Python 39 3 Updated Nov 11, 2025

EvolvingLMMs-Lab / lmms-engine

A simple, unified multimodal models training engine. Lean, flexible, and built for hacking at scale.

Python 693 27 Updated Jan 9, 2026

MINT-SJTU / STI-Bench

STI-Bench : Are MLLMs Ready for Precise Spatial-Temporal World Understanding?

Python 35 1 Updated Jul 7, 2025

DennisRotondi / awesome-3D-scene-graphs

Awesome 3D Scene Graphs: a curated list of 3D scene graph generation and related resources!

89 3 Updated Dec 18, 2024

GradientSpaces / GuideFlow3D

[NeurIPS 2025] GuideFlow3D: Optimization-Guided Rectified Flow For Appearance Transfer

24 1 Updated Dec 1, 2025

GradientSpaces / CrossOver

[CVPR 2025, Highlight] CrossOver: 3D Scene Cross-Modal Alignment

Python 197 8 Updated Jan 4, 2026

PKU-YuanGroup / Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 12,109 1,070 Updated Oct 29, 2025

PKU-YuanGroup / Video-LLaVA

【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Python 3,432 250 Updated Dec 3, 2024

gaiadilorenzo / object-x

Python 40 4 Updated Oct 26, 2025

CodeBoarding / CodeBoarding

🪄 Interactive Diagrams for Code

Python 930 75 Updated Jan 7, 2026

FocoosAI / focoos

🚀 Lightning-fast computer vision models. Fine-tune SOTA models with just a few lines of code. Ready for cloud ☁️ and edge 📱 deployment.

Python 346 2 Updated Dec 11, 2025

GradientSpaces / Rectified-Point-Flow

[NeurIPS 2025, Spotlight] Rectified Point Flow: Generic Point Cloud Pose Estimation

Python 164 12 Updated Dec 2, 2025

GradientSpaces / respace

Code for "ReSpace: Text-Driven 3D Indoor Scene Synthesis and Editing with Preference Alignment"

Python 56 2 Updated Dec 9, 2025

mit-acl / roman

[RSS 2025] ROMAN: a view-invariant global localization method that matches objects from different robot views for reliable pose estimation even when a scene is observed from opposite views

Python 271 16 Updated Dec 8, 2025

DanaCohen95 / TriTex

JavaScript 9 Updated Mar 24, 2025

CodeBoarding / GeneratedOnBoardings

A collection of onboarding diagrams of different project online

96 5 Updated Dec 1, 2025

microsoft / TRELLIS

Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).

Python 11,534 1,073 Updated Nov 5, 2025

sayands / sgaligner

[ICCV 2023] SGAligner: 3D Scene Alignment with Scene Graphs

Python 109 9 Updated Oct 27, 2025

bethgelab / frequency_determines_performance

Code for the paper: "No Zero-Shot Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance" [NeurIPS'24]

Jupyter Notebook 93 4 Updated Apr 29, 2024

unique1i / SceneSplat

[ICCV 2025 Oral] SceneSplat - Gaussian Splatting-based Scene Understanding with Vision-Language Pretraining

Python 287 14 Updated Dec 11, 2025

vevenom / pytorchgeonodes

PyTorchGeoNodes is a PyTorch module for differentiable shape programs / procedural models in forms of graphs. It can automatically translate Blender geometry node models into PyTorch code. Original…

Python 43 1 Updated Nov 23, 2025

oliver-lemke / spot-compose

A Framework for Open-Vocabulary Object Retrieval and Drawer Manipulation in Point Clouds

Python 29 7 Updated Jan 19, 2025

GradientSpaces / WildGS-SLAM

[CVPR 2025] WildGS-SLAM: Monocular Gaussian Splatting SLAM in Dynamic Environments

Python 525 52 Updated Jan 6, 2026

sayands / slangpy-ml

Forked from JitongZ/copy-of-slangpy-neuralnetwork

Python 1 Updated Mar 18, 2025

kevinYitshak / spurfies

Spurfies: Sparse Surface Reconstruction using Local Geometry Priors

Python 28 1 Updated Nov 6, 2024

facebookresearch / flow_matching

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 3,959 291 Updated Jan 5, 2026

GradientSpaces / HouseTour

[ICCV 2025] HouseTour: A Virtual Real Estate A(I)gent

Python 21 Updated Oct 22, 2025

Devinterview-io / computer-vision-interview-questions

🟣 Computer Vision interview questions and answers to help you prepare for your next machine learning and data science interview in 2026.

104 18 Updated Jan 4, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sayan Deb Sarkar sayands

Achievements

Achievements

Organizations

Block or report sayands

Stars

LaVi-Lab / Video-3D-LLM

nianticlabs / acezero

chaitanya100100 / UniEgoMotion

EvolvingLMMs-Lab / lmms-engine

MINT-SJTU / STI-Bench

DennisRotondi / awesome-3D-scene-graphs

GradientSpaces / GuideFlow3D

GradientSpaces / CrossOver

PKU-YuanGroup / Open-Sora-Plan

PKU-YuanGroup / Video-LLaVA

gaiadilorenzo / object-x

CodeBoarding / CodeBoarding

FocoosAI / focoos

GradientSpaces / Rectified-Point-Flow

GradientSpaces / respace

mit-acl / roman

DanaCohen95 / TriTex

CodeBoarding / GeneratedOnBoardings

microsoft / TRELLIS

sayands / sgaligner

bethgelab / frequency_determines_performance

unique1i / SceneSplat

vevenom / pytorchgeonodes

oliver-lemke / spot-compose

GradientSpaces / WildGS-SLAM

sayands / slangpy-ml

kevinYitshak / spurfies

facebookresearch / flow_matching

GradientSpaces / HouseTour

Devinterview-io / computer-vision-interview-questions