Skip to content
View seanguo61's full-sized avatar
🎯
Focusing
🎯
Focusing
  • SMART, Singapore
  • Singapore

Block or report seanguo61

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.

Python 1,656 157 Updated Oct 20, 2025

[arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation

Jupyter Notebook 91 3 Updated Mar 1, 2025

Python package for retrieving current and historical photos from Google Street View

Python 483 135 Updated Aug 23, 2025

Download Google Street View panoramas efficiently.

Python 46 4 Updated Oct 2, 2025

The repo of Street View Image, Pose, and 3D Cities Dataset. Used in "Generic 3D Representation via Pose Estimation and Matching", ECCV16

474 66 Updated Jul 11, 2022

The best ChatGPT that $100 can buy.

Python 31,682 3,401 Updated Oct 22, 2025

A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related webs…

607 15 Updated Oct 23, 2025

Collect some World Models for Autonomous Driving (and Robotic) papers.

1,443 59 Updated Oct 14, 2025

Interpretable time series autoregression for periodicity quantification

Jupyter Notebook 43 4 Updated Oct 21, 2025

Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.

Python 685 56 Updated Sep 24, 2025

Repository of AudioX

Python 1,099 122 Updated Apr 30, 2025

papers related to diffusion language models

1 Updated Jun 19, 2025

[NeurIPS'22] Tokenized Graph Transformer (TokenGT), in PyTorch

Python 342 50 Updated Apr 11, 2023

Training framework for Large Behavioral Models

Python 26 1 Updated Sep 17, 2025
Python 116 13 Updated Mar 1, 2024

[CVPR 2023] Query-Centric Trajectory Prediction

Python 714 100 Updated Oct 10, 2023

Geometric Latent Diffusion Models for 3D Molecule Generation

Python 258 48 Updated Jun 9, 2023

Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch

Python 459 13 Updated Jan 12, 2025

Official Pytorch implementation of the paper "MotionCLIP: Exposing Human Motion Generation to CLIP Space"

Python 477 43 Updated Dec 18, 2023

CLoSD: Closing the Loop between Simulation and Diffusion for multi-task character control

Python 391 28 Updated May 2, 2025

Waymo Open Dataset

Python 3,085 679 Updated Jun 10, 2025

[NeurIPS 2024] SMART: Scalable Multi-agent Real-time Motion Generation via Next-token Prediction

Python 208 29 Updated Sep 30, 2025

A Unified Framework for scalable Vehicle Trajectory Prediction, ECCV 2024

Python 417 49 Updated Sep 29, 2025

Implementation of MapDiff: "Mask-prior-guided denoising diffusion improves inverse protein folding" in PyTorch

Python 39 10 Updated Jul 11, 2025

Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)

Python 127 1 Updated Oct 11, 2023

Recipe for a General, Powerful, Scalable Graph Transformer

Python 791 141 Updated Jul 4, 2024

Code for EMNLP22 SpaBERT: A Pretrained Language Model from Geographic Data for Geo-Entity Representation.

Python 21 8 Updated Jun 22, 2023
Next