-
University of Maryland, College Park
- College Park, MD
- http://twizwei.github.io/
Highlights
- Pro
Stars
Code for ICCV'2025 (Best student paper honorable mention) "RayZer: A Self-supervised Large View Synthesis Model"
[ICLR 2025 Oral] Official code for "LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias"
Wan: Open and Advanced Large-Scale Video Generative Models
PE3R: Perception-Efficient 3D Reconstruction. Take 2 - 3 photos with your phone, upload them, wait a few minutes, and then start exploring your 3D world via text!
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models
A suite of image and video neural tokenizers
[ICLR2024] The official implementation of paper "VDT: General-purpose Video Diffusion Transformers via Mask Modeling", by Haoyu Lu, Guoxing Yang, Nanyi Fei, Yuqi Huo, Zhiwu Lu, Ping Luo, Mingyu Ding.
super-ai: unified-vision, math-think/mathink; private
Setup PyTorch on Mac/Apple Silicon plus a few benchmarks.
AI assistant that can query visual datasets, search the FiftyOne docs, and answer general computer vision questions
Refine high-quality datasets and visual AI models
A unified framework for 3D content generation.
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
PyTorch code and models for the DINOv2 self-supervised learning method.
Efficient distortion loss with O(n) realization.
[ICRA2023] Video Waterdrop Removal via Spatio-Temporal Fusion in Driving Scenes
Official PyTorch implementation of StyleGAN3
Navigating StyleGAN2 w latent space using CLIP
[CVPR 2021 Best Paper Award Candidate] PoseAug: A Differentiable Pose Augmentation Framework for 3D Human Pose Estimation, (Oral, Best Paper Award Finalist)