saidwivedi

Sai Kumar Dwivedi saidwivedi

Research Scientist Intern at Meta | PhD Candidate at MPI-IS | Ex: Mercedes-Benz, Intel

88 followers · 188 following

Max Planck Institute for Intelligent Systems
Germany
https://saidwivedi.in
@saidwivedi
in/saidwivedi
@saidwivedi.in

Achievements

Highlights

Lists (32)

Sort

Stars

Tongyi-MAI / Z-Image

2,347 108 Updated Nov 29, 2025

facebookresearch / sam3

The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…

Python 4,699 434 Updated Nov 29, 2025

facebookresearch / sam-3d-objects

SAM 3D Objects

Python 4,157 317 Updated Nov 21, 2025

baaivision / UniVLA

Unified Vision-Language-Action Model

Python 243 17 Updated Oct 15, 2025

EricWang12 / PartUV

[SIGGRAPH ASIA 2025] Code for PartUV: Part-Based UV Unwrapping of 3D Meshes

C++ 97 10 Updated Nov 21, 2025

kandinskylab / kandinsky-5

Kandinsky 5.0: A family of diffusion models for Video & Image generation

Python 494 28 Updated Nov 28, 2025

facebookresearch / sam-3d-body

The repository provides code for running inference with the SAM 3D Body Model (3DB), links for downloading the trained model checkpoints and datasets, and example notebooks that show how to use the…

Python 1,945 145 Updated Nov 25, 2025

facebookresearch / MHR

Momentum Human Rig is an anatomically-inspired parametric full-body digital human model developed at Meta. It includes: A parametric body skeletal model; A realistic 3D mesh skinned to the skeleton…

Jupyter Notebook 417 20 Updated Nov 27, 2025

ByteDance-Seed / Depth-Anything-3

Depth Anything 3

Jupyter Notebook 2,998 228 Updated Nov 28, 2025

potree / potree

WebGL point cloud viewer for large datasets

JavaScript 5,195 1,292 Updated Aug 24, 2024

zbw001 / TAPIP3D

TAPIP3D: Tracking Any Point in Persistent 3D Geometry

Python 328 21 Updated Sep 27, 2025

AiEson / Part-X-MLLM

Part-X-MLLM: Part-aware 3D Multimodal Large Language Model

90 3 Updated Nov 28, 2025

mll-lab-nu / Awesome-Spatial-Intelligence-in-VLM

A paper list for spatial reasoning

456 29 Updated Nov 27, 2025

tum-vision / flowfeat

FlowFeat: Pixel-Dense Embedding of Motion Profiles (NeurIPS 2025 Spotlight)

Python 54 5 Updated Nov 25, 2025

jingyugong / SSOMotion

Official Implementation of Human Motion Synthesis in 3D Scenes via Unified Scene Semantic Occupancy (AAAI2026)

3 1 Updated Nov 10, 2025

Axellwppr / gentle-humanoid

Python 69 1 Updated Nov 7, 2025

YihongSun / TubeletGraph

[NeurIPS 2025] Tracking and Understanding Object Transformations

Jupyter Notebook 180 23 Updated Nov 18, 2025

snap-research / SnapMoGen

SnapMoGen: Human Motion Generation from Expressive Texts [NeurIPS 2025]

Python 77 8 Updated Sep 26, 2025

RockeyCoss / Prompt-Segment-Anything

This is an implementation of zero-shot instance segmentation using Segment Anything.

Python 314 14 Updated Apr 14, 2023

Jiaxin-Lu / humoto

[ICCV 2025] HUMOTO Dataset Code Release

Python 38 1 Updated Nov 6, 2025

naver / anny

Anny, A Free and Interpretable Human Body Model for all ages, written in PyTorch.

Python 289 18 Updated Nov 21, 2025

ptrvilya / tridi

[ICCV'25] Method for generating static human-object interactions

Python 26 Updated Oct 28, 2025

krea-ai / realtime-video

Krea Realtime 14B. An open-source realtime AI video model.

Python 399 22 Updated Nov 13, 2025

knightnemo / Awesome-World-Models

A Curated List of Awesome Works in World Modeling, Aiming to Serve as a One-stop Resource for Researchers, Practitioners, and Enthusiasts Interested in World Modeling.

1,023 32 Updated Nov 29, 2025

DAVIAN-Robotics / PHUMA

Code for "PHUMA: Physically-Grounded Humanoid Locomotion Dataset"

Python 146 7 Updated Nov 11, 2025

pixeli99 / SVD_Xtend

Stable Video Diffusion Training Code and Extensions.

Python 723 74 Updated Jul 25, 2024

VDIGPKU / EA3D

29 Updated Nov 17, 2025

microsoft / VITRA

VITRA: Scalable Vision-Language-Action Model Pretraining for Robotic Manipulation with Real-Life Human Activity Videos

114 2 Updated Oct 28, 2025

Tencent-Hunyuan / HunyuanWorld-Mirror

Fast and Universal 3D reconstruction model for versatile tasks

Python 847 64 Updated Nov 27, 2025

elisabettafedele / superdec

[ICCV 2025] SuperDec: 3D Scene Decomposition with  Superquadric Primitives.

Python 153 7 Updated Nov 19, 2025

Sai Kumar Dwivedi saidwivedi

Highlights

Lists (32)

2D / 3D Keypoints

3D-Avatar-NonParam

3D from Image/Video

3D from Text

3D + Language

Architecture

Curation List

Datasets

Depth Estimation

Digital Human <-> Robotics

Hand Mesh Recovery

HSI-Generation

Human Body Mesh

Human Motion

Human-Object-Interaction

Human-Object-Reconstruction-3D

Human Parsing

Human-Scene-Interaction

Image Generation

Inpainting / EditAnything

Large Scale Foundation Model

Misc

NeRF / SDF / Implicit

Object-6DOF

Object Detection

Object Tracking

Pose Embedding

Segmentation

Tools

Video + Language

Vision Embedding

Vision + Language

Stars