wzzheng

Wenzhao Zheng wzzheng

501 followers · 80 following

BAIR, UC Berkeley
Berkeley
wzzheng.net

Achievements

Highlights

Stars

wangyr22 / DepthGS

Official implementation of IROS 2025 paper Pseudo Depth Meets Gaussian: A Feed-forward RGB SLAM Baseline

Python 41 4 Updated Aug 11, 2025

WEIRDLabUW / unified-world-model

Unfied World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets

Python 157 10 Updated Oct 8, 2025

AMAP-EAI / SocialNav

10 Updated Nov 27, 2025

sjtuplayer / Harmony

Audio-video joint generation

29 Updated Nov 27, 2025

InternRobotics / G2VLM

G2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning

Python 144 1 Updated Nov 27, 2025

humanoid-vstar / hstar

Thinking in 360°: Humanoid Visual Search in the Wild

Python 57 Updated Nov 26, 2025

open-gigaai / giga-world-0

GigaWorld-0: World Models as Data Engine to Empower Embodied AI

Python 160 14 Updated Nov 26, 2025

apple / ml-starflow

Python 51 Updated Nov 26, 2025

wm-research / worldsplat

WorldSplat: Gaussian-Centric Feed-Forward 4D Scene Generation for Autonomous Driving

95 5 Updated Nov 1, 2025

Kr1sJFU / iMontage

iMontage: Unified, Versatile, Highly Dynamic Many-to-many Image Generation

Python 60 Updated Nov 26, 2025

ShangyuanTong / FreeFlow

Official PyTorch Implementation of "Flow Map Distillation Without Data"

Python 72 6 Updated Nov 25, 2025

FunAudioLLM / ThinkSound

[NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Thought (CoT) reasoning.

Python 1,081 61 Updated Nov 25, 2025

Wakals / CoVT

Python 68 3 Updated Nov 27, 2025

Zehong-Ma / DeCo

Official repository for “DeCo: Frequency-Decoupled Pixel Diffusion for End-to-End Image Generation”

Python 77 2 Updated Nov 26, 2025

Tencent-Hunyuan / HunyuanVideo-1.5

HunyuanVideo-1.5: A leading lightweight video generation model

Python 900 65 Updated Nov 29, 2025

gooogleshanghai / ActDistill

Action-Guided Knowledge Distillation for VLA Models

Python 11 Updated Nov 25, 2025

leo-frank / Muskie

Muskie: Multi-view Masked Image Modeling for 3D Vision Pre-training

Jupyter Notebook 9 Updated Nov 27, 2025

W2GenAI-Lab / UltraFlux

UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios

Python 54 1 Updated Nov 27, 2025

Alpha-VLLM / Lumina-DiMOO

Lumina-DiMOO - An Open-Sourced Multi-Modal Large Diffusion Language Model

Python 896 58 Updated Nov 26, 2025

GAP-LAB-CUHK-SZ / ReconViaGen

ReconViaGen: Towards Accurate Multi-view 3D Object Reconstruction via Generation

436 6 Updated Oct 11, 2025

davnords / mum

MuM's a pretty good feature extractor for 3D tasks, probably the best.

Python 47 Updated Nov 24, 2025

ByteDance-Seed / VeOmni

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 1,354 109 Updated Nov 29, 2025

XiaomiMiMo / MiMo-Embodied

MiMo-Embodied

Python 259 7 Updated Nov 21, 2025

MatchLab-Imperial / POMA-3D

Offical Repository of POMA-3D: The Point Map Way to 3D Scene Understanding.

12 1 Updated Nov 9, 2025

Zeqiang-Lai / NaTex

NaTex: Seamless Texture Generation as Latent Color Diffusion

87 1 Updated Nov 26, 2025

time-to-move / TTM

Official Pytorch Implementation for "Time-to-Move: Training-Free Motion Controlled Video Generation via Dual-Clock Denoising"

Python 220 17 Updated Nov 27, 2025

kyutai-labs / moshi

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 9,123 826 Updated Nov 20, 2025

QwenLM / Qwen2-Audio

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,989 153 Updated Apr 21, 2025

ictnlp / LLaMA-Omni2

Python 250 26 Updated May 19, 2025

character-ai / Ovi

Python 1,365 134 Updated Nov 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wenzhao Zheng wzzheng

Achievements

Achievements

Highlights

Block or report wzzheng

Stars

wangyr22 / DepthGS

WEIRDLabUW / unified-world-model

AMAP-EAI / SocialNav

sjtuplayer / Harmony

InternRobotics / G2VLM

humanoid-vstar / hstar

open-gigaai / giga-world-0

apple / ml-starflow

wm-research / worldsplat

Kr1sJFU / iMontage

ShangyuanTong / FreeFlow

FunAudioLLM / ThinkSound

Wakals / CoVT

Zehong-Ma / DeCo

Tencent-Hunyuan / HunyuanVideo-1.5

gooogleshanghai / ActDistill

leo-frank / Muskie

W2GenAI-Lab / UltraFlux

Alpha-VLLM / Lumina-DiMOO

GAP-LAB-CUHK-SZ / ReconViaGen

davnords / mum

ByteDance-Seed / VeOmni

XiaomiMiMo / MiMo-Embodied

MatchLab-Imperial / POMA-3D

Zeqiang-Lai / NaTex

time-to-move / TTM

kyutai-labs / moshi

QwenLM / Qwen2-Audio

ictnlp / LLaMA-Omni2

character-ai / Ovi