chenguolin

👶

Learning & Coding

Chenguo Lin chenguolin

👶

Learning & Coding

95 followers · 91 following

Peking University
Beijing, China
23:41 (UTC +08:00)
https://chenguolin.github.io
@lin_chenguo

Achievements

Highlights

Starred repositories

qiuzh20 / gated_attention

The official implementation for [NeurIPS2025 Oral] Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free

Jupyter Notebook 193 9 Updated Sep 19, 2025

Tongyi-MAI / Z-Image

1,721 62 Updated Nov 28, 2025

black-forest-labs / flux2

Official inference repo for FLUX.2 models

Python 834 30 Updated Nov 26, 2025

zli12321 / FFGO-Video-Customization

Video Content Customization Using First Frame

Python 97 1 Updated Nov 26, 2025

lifuguan / IGGT_official

IGGT: Instance-Grounded Geometry Transformer for Semantic 3D Reconstruction

Python 271 7 Updated Nov 21, 2025

Tencent-Hunyuan / HunyuanVideo-1.5

HunyuanVideo-1.5: A leading lightweight video generation model

Python 861 65 Updated Nov 28, 2025

kandinskylab / kandinsky-5

Kandinsky 5.0: A family of diffusion models for Video & Image generation

Python 483 26 Updated Nov 28, 2025

EricWang12 / PartUV

[SIGGRAPH ASIA 2025] Code for PartUV: Part-Based UV Unwrapping of 3D Meshes

C++ 96 9 Updated Nov 21, 2025

facebookresearch / sam-3d-body

The repository provides code for running inference with the SAM 3D Body Model (3DB), links for downloading the trained model checkpoints and datasets, and example notebooks that show how to use the…

Python 1,921 141 Updated Nov 25, 2025

facebookresearch / sam3

The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…

Python 4,602 411 Updated Nov 25, 2025

facebookresearch / sam-3d-objects

SAM 3D Objects

Python 4,067 304 Updated Nov 21, 2025

tyfeld / MMaDA-Parallel

Official Implementation of "MMaDA-Parallel: Multimodal Large Diffusion Language Models for Thinking-Aware Editing and Generation"

Python 262 8 Updated Nov 19, 2025

LTH14 / JiT

PyTorch implementation of JiT https://arxiv.org/abs/2511.13720

Python 1,426 64 Updated Nov 18, 2025

ByteDance-Seed / Depth-Anything-3

Depth Anything 3

Jupyter Notebook 2,969 225 Updated Nov 28, 2025

OpenSenseNova / SenseNova-SI

Scaling Spatial Intelligence with Multimodal Foundation Models

Python 117 7 Updated Nov 21, 2025

Yangr116 / VST

Visual Spatial Tuning

Jupyter Notebook 146 6 Updated Nov 16, 2025

cambrian-mllm / cambrian-s

Cambrian-S: Towards Spatial Supersensing in Video

Python 396 11 Updated Nov 10, 2025

FoundationVision / InfinityStar

[NeurIPS 2025 Oral]Infinity⭐️: Uniﬁed Spacetime AutoRegressive Modeling for Visual Generation

Python 592 21 Updated Nov 27, 2025

FishWoWater / CAST

An unofficial and simplified implementation of SIGGRAPH 2025 best paper nominate: CAST: Component-Aligned 3D Scene Reconstruction from an RGB Image, working in progress

Python 157 5 Updated Nov 9, 2025

nv-tlabs / ChronoEdit

ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation

Python 607 34 Updated Nov 20, 2025

hkdsc / fullpart

A part-based 3D generation framework & the largest and most comprehensively annotated 3D part dataset.

Jupyter Notebook 105 2 Updated Nov 24, 2025

Vchitect / VBench

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

Python 1,344 91 Updated Oct 16, 2025

TencentARC / RollingForcing

Official Repo for Rolling Forcing: Autoregressive Long Video Diffusion in Real Time

Python 252 10 Updated Oct 31, 2025

baaivision / Emu3.5

Native Multimodal Models are World Learners

Python 1,294 46 Updated Nov 28, 2025

caiyuanhao1998 / Open-DiffusionGS

Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation and Reconstruction (ICCV 2025)

Python 646 32 Updated Nov 24, 2025

HKUDS / ViMax

"ViMax: Agentic Video Generation (Director, Screenwriter, Producer, and Video Generator All-in-One)"

Python 1,217 207 Updated Nov 16, 2025

NVlabs / L4P

L4P -- a feed-forward foundational model designed for multiple low-level 4D vision perception tasks.

Python 35 Updated Sep 22, 2025

svg-project / Sparse-VideoGen

[ICML2025, NeurIPS2025 Spotlight] Sparse VideoGen 1 & 2: Accelerating Video Diffusion Transformers with Sparse Attention

Python 587 30 Updated Nov 18, 2025

EvolvingLMMs-Lab / lmms-engine

A simple, unified multimodal models training engine. Lean, flexible, and built for hacking at scale.

Python 660 24 Updated Nov 28, 2025

AuthorityWang / PartNeXt

[Neurips DB 2025] PartNeXt: A Next-Generation Dataset for Fine-Grained and Hierarchical 3D Part Understanding

Python 83 1 Updated Nov 4, 2025

Chenguo Lin chenguolin

Highlights

Starred repositories

Awesome Lists