whwu95

♥️

I may be slow to respond.

Wenhao Wu whwu95

♥️

I may be slow to respond.

Scientist @ Amazon | Ex-Ph.D. @ USYD

144 followers · 29 following

Amazon AGI
Bellevue, WA, US
21:41 (UTC -08:00)
whwu95.github.io
@dr_wenhao
in/wenhao-w-usyd

Achievements

Highlights

Stars

Wan-Video / Wan2.1

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,649 2,115 Updated Jul 17, 2025

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,007 299 Updated Nov 3, 2025

InternLM / Intern-S1

A Scientific Multimodal Foundation Model

604 29 Updated Sep 30, 2025

hshjerry / VideoEspresso

[CVPR 2025 Oral] VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection

Python 125 4 Updated Jul 28, 2025

PeterGriffinJin / Search-R1

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 3,464 295 Updated Oct 29, 2025

HJYao00 / Awesome-Reasoning-MLLM

Awesome Reasoning in MLLMs: Papers and Projects about learning to reason with MLLMs, including Chain-of-Thought (CoT), OpenAl o1, and DeepSeek-R1

57 4 Updated Mar 18, 2025

alexeyinkin / eb-1a

TeX 92 44 Updated Jan 29, 2025

HJYao00 / Mulberry

[NIPS'25 Spotlight] Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTS

Python 1,224 110 Updated Sep 19, 2025

swordlidev / Efficient-Multimodal-LLMs-Survey

Efficient Multimodal Large Language Models: A Survey

375 20 Updated Apr 29, 2025

QwenLM / Qwen3-VL

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 16,089 1,276 Updated Oct 27, 2025

QwenLM / Qwen2.5-Math

A series of math-specific large language models of our Qwen2 series.

Python 1,024 143 Updated Jan 11, 2025

InternLM / lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 7,231 617 Updated Nov 7, 2025

QwenLM / Qwen2-Audio

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,922 146 Updated Apr 21, 2025

autogluon / autogluon-rag

Retrieval-Augmented Generation in 3 Lines of Code!

Python 49 6 Updated Feb 3, 2025

AudioLLMs / AudioBench

AudioBench: A Universal Benchmark for Audio Large Language Models

Python 271 13 Updated Jun 17, 2025

takomc / amp

【NeurIPS 2024】The official code of paper "Automated Multi-level Preference for MLLMs"

Python 20 1 Updated Sep 26, 2024

HJYao00 / DenseConnector

【NeurIPS 2024】Dense Connector for MLLMs

Python 180 8 Updated Oct 14, 2024

whwu95 / FreeVA

FreeVA: Offline MLLM as Training-Free Video Assistant

Python 64 1 Updated Jun 9, 2024

RayeRen / acad-homepage.github.io

AcadHomepage: A Modern and Responsive Academic Personal Homepage

SCSS 2,370 4,813 Updated Nov 6, 2025

johnnyhwu / Awesome-LLM-Tabular

Awesome-LLM-Tabular: a curated list of Large Language Model applied to Tabular Data

417 32 Updated Dec 22, 2024

whwu95 / GPT4Vis

GPT4Vis: What Can GPT-4 Do for Zero-shot Visual Recognition?

Python 186 18 Updated May 22, 2024

whwu95 / ATM

【ICCV'2023】What Can Simple Arithmetic Operations Do for Temporal Modeling?

Python 74 6 Updated Jan 26, 2024

MisterBooo / LeetCodeAnimation

Demonstrate all the questions on LeetCode in the form of animation.（用动画的形式呈现解LeetCode题目的思路）

Java 76,504 14,016 Updated Aug 14, 2023

progfay / shields-with-icon

Enjoy https://shields.io

Go 453 250 Updated Nov 3, 2025

nerfies / nerfies.github.io

JavaScript 3,683 1,577 Updated Jun 21, 2024

mdyao / CSNorm

[ICCV 2023] Official Implementation of "Generalized Lightness Adaptation with Channel Selective Normalization"

Python 85 6 Updated Jan 22, 2024

mdyao / Awesome-3D-AIGC

A curated list of papers and open-source resources focused on 3D AIGC.

331 17 Updated Sep 1, 2024

Ileriayo / markdown-badges

Badges for your personal developer branding, profile, and projects.

SCSS 15,777 1,763 Updated Nov 6, 2025

PKU-YuanGroup / Video-LLaVA

【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Python 3,393 245 Updated Dec 3, 2024

HJYao00 / Side4Video

Python 42 3 Updated Apr 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wenhao Wu whwu95

Achievements

Achievements

Highlights

Block or report whwu95

Stars

Wan-Video / Wan2.1

hiyouga / EasyR1

InternLM / Intern-S1

hshjerry / VideoEspresso

PeterGriffinJin / Search-R1

HJYao00 / Awesome-Reasoning-MLLM

alexeyinkin / eb-1a

HJYao00 / Mulberry

swordlidev / Efficient-Multimodal-LLMs-Survey

QwenLM / Qwen3-VL

QwenLM / Qwen2.5-Math

InternLM / lmdeploy

QwenLM / Qwen2-Audio

autogluon / autogluon-rag

AudioLLMs / AudioBench

takomc / amp

HJYao00 / DenseConnector

whwu95 / FreeVA

RayeRen / acad-homepage.github.io

johnnyhwu / Awesome-LLM-Tabular

whwu95 / GPT4Vis

whwu95 / ATM

MisterBooo / LeetCodeAnimation

progfay / shields-with-icon

nerfies / nerfies.github.io

mdyao / CSNorm

mdyao / Awesome-3D-AIGC

Ileriayo / markdown-badges

PKU-YuanGroup / Video-LLaVA

HJYao00 / Side4Video