Skip to content
View chenguolin's full-sized avatar
👶
Learning & Coding
👶
Learning & Coding

Highlights

  • Pro

Block or report chenguolin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

The official implementation for [NeurIPS2025 Oral] Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free

Jupyter Notebook 193 9 Updated Sep 19, 2025

Official inference repo for FLUX.2 models

Python 834 30 Updated Nov 26, 2025

Video Content Customization Using First Frame

Python 97 1 Updated Nov 26, 2025

IGGT: Instance-Grounded Geometry Transformer for Semantic 3D Reconstruction

Python 271 7 Updated Nov 21, 2025

HunyuanVideo-1.5: A leading lightweight video generation model

Python 861 65 Updated Nov 28, 2025

Kandinsky 5.0: A family of diffusion models for Video & Image generation

Python 483 26 Updated Nov 28, 2025

[SIGGRAPH ASIA 2025] Code for PartUV: Part-Based UV Unwrapping of 3D Meshes

C++ 96 9 Updated Nov 21, 2025

The repository provides code for running inference with the SAM 3D Body Model (3DB), links for downloading the trained model checkpoints and datasets, and example notebooks that show how to use the…

Python 1,921 141 Updated Nov 25, 2025

The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…

Python 4,602 411 Updated Nov 25, 2025

SAM 3D Objects

Python 4,067 304 Updated Nov 21, 2025

Official Implementation of "MMaDA-Parallel: Multimodal Large Diffusion Language Models for Thinking-Aware Editing and Generation"

Python 262 8 Updated Nov 19, 2025

PyTorch implementation of JiT https://arxiv.org/abs/2511.13720

Python 1,426 64 Updated Nov 18, 2025

Depth Anything 3

Jupyter Notebook 2,969 225 Updated Nov 28, 2025

Scaling Spatial Intelligence with Multimodal Foundation Models

Python 117 7 Updated Nov 21, 2025

Visual Spatial Tuning

Jupyter Notebook 146 6 Updated Nov 16, 2025

Cambrian-S: Towards Spatial Supersensing in Video

Python 396 11 Updated Nov 10, 2025

[NeurIPS 2025 Oral]Infinity⭐️: Unified Spacetime AutoRegressive Modeling for Visual Generation

Python 592 21 Updated Nov 27, 2025

An unofficial and simplified implementation of SIGGRAPH 2025 best paper nominate: CAST: Component-Aligned 3D Scene Reconstruction from an RGB Image, working in progress

Python 157 5 Updated Nov 9, 2025

ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation

Python 607 34 Updated Nov 20, 2025

A part-based 3D generation framework & the largest and most comprehensively annotated 3D part dataset.

Jupyter Notebook 105 2 Updated Nov 24, 2025

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

Python 1,344 91 Updated Oct 16, 2025

Official Repo for Rolling Forcing: Autoregressive Long Video Diffusion in Real Time

Python 252 10 Updated Oct 31, 2025

Native Multimodal Models are World Learners

Python 1,294 46 Updated Nov 28, 2025

Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation and Reconstruction (ICCV 2025)

Python 646 32 Updated Nov 24, 2025

"ViMax: Agentic Video Generation (Director, Screenwriter, Producer, and Video Generator All-in-One)"

Python 1,217 207 Updated Nov 16, 2025

L4P -- a feed-forward foundational model designed for multiple low-level 4D vision perception tasks.

Python 35 Updated Sep 22, 2025

[ICML2025, NeurIPS2025 Spotlight] Sparse VideoGen 1 & 2: Accelerating Video Diffusion Transformers with Sparse Attention

Python 587 30 Updated Nov 18, 2025

A simple, unified multimodal models training engine. Lean, flexible, and built for hacking at scale.

Python 660 24 Updated Nov 28, 2025

[Neurips DB 2025] PartNeXt: A Next-Generation Dataset for Fine-Grained and Hierarchical 3D Part Understanding

Python 83 1 Updated Nov 4, 2025
Next