SilentView

Tianwei Xiong SilentView

I am a HKU EEE Year 2 PhD Student from HKU-MMLab

20 followers · 16 following

Achievements

Highlights

Stars

shengyp / doing_the_PhD

2,256 274 Updated Oct 7, 2025

baaivision / Emu3.5

Native Multimodal Models are World Learners

Python 1,295 46 Updated Nov 28, 2025

HKU-MMLab / OmniX

Official implementation of "OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes".

Python 80 2 Updated Nov 3, 2025

NVlabs / RADIO

Official repository for "AM-RADIO: Reduce All Domains Into One"

Python 1,403 51 Updated Nov 27, 2025

aakaran / reasoning-with-sampling

Python 333 43 Updated Nov 7, 2025

thu-ml / RDT2

Official code of RDT 2

Python 590 26 Updated Oct 11, 2025

WayneJin0918 / SRUM

About Official repo of paper "SRUM: Fine-Grained Self-Rewarding for Unified Multimodal Models". A post-training framework that creates a cost-effective, self-iterative optimization loop.

Python 83 4 Updated Nov 26, 2025

HKU-MMLab / Math-VR-CodePlot-CoT

Math-VR Benchmark & CodePlot-CoT: Mathematical Visual Reasoning by Thinking with Code-Driven Images

Python 41 3 Updated Nov 4, 2025

HKU-MMLab / OmniPart

[SIGGRAPH Asia 2025] OmniPart: Part-Aware 3D Generation with Semantic Decoupling and Structural Cohesion

Python 150 8 Updated Nov 6, 2025

facebookresearch / dinov3

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 8,590 615 Updated Nov 20, 2025

ZhengrongYue / UniFlow

Official Implementation of "UniFlow: A Unified Pixel Flow Tokenizer for Visual Understanding and Generation"

Jupyter Notebook 124 2 Updated Oct 17, 2025

guolinke / SphereAR

Implementation of "Hyperspherical Latents Improve Continuous-Token Autoregressive"

Python 80 6 Updated Nov 15, 2025

dc-ai-projects / DC-VideoGen

DC-VideoGen: Efficient Video Generation with Deep Compression Video Autoencoder

167 7 Updated Oct 5, 2025

alibaba-damo-academy / Lumos

Lumos Project: Frontier video unified model research by Alibaba DAMO Academy.

Python 144 3 Updated Jul 17, 2025

FoundationVision / OmniTokenizer

[NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.

Python 317 8 Updated Jul 9, 2024

WangRongsheng / awesome-LLM-resources

🧑‍🚀 全世界最好的LLM资料总结（语音视频生成、Agent、辅助编程、数据处理、模型训练、模型推理、o1 模型、MCP、小语言模型、视觉语言模型） | Summary of the world's best LLM resources.

6,839 650 Updated Nov 29, 2025

ssundaram21 / dreamsim

DreamSim: Learning New Dimensions of Human Visual Similarity using Synthetic Data (NeurIPS 2023 Spotlight) / / / / When Does Perceptual Alignment Benefit Vision Representations? (NeurIPS 2024)

Python 555 31 Updated Nov 24, 2025