Skip to content
View whwu95's full-sized avatar
♥️
I may be slow to respond.
♥️
I may be slow to respond.

Highlights

  • Pro

Block or report whwu95

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,649 2,115 Updated Jul 17, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,007 299 Updated Nov 3, 2025

A Scientific Multimodal Foundation Model

604 29 Updated Sep 30, 2025

[CVPR 2025 Oral] VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection

Python 125 4 Updated Jul 28, 2025

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 3,464 295 Updated Oct 29, 2025

Awesome Reasoning in MLLMs: Papers and Projects about learning to reason with MLLMs, including Chain-of-Thought (CoT), OpenAl o1, and DeepSeek-R1

57 4 Updated Mar 18, 2025
TeX 92 44 Updated Jan 29, 2025

[NIPS'25 Spotlight] Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTS

Python 1,224 110 Updated Sep 19, 2025

Efficient Multimodal Large Language Models: A Survey

375 20 Updated Apr 29, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 16,089 1,276 Updated Oct 27, 2025

A series of math-specific large language models of our Qwen2 series.

Python 1,024 143 Updated Jan 11, 2025

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 7,231 617 Updated Nov 7, 2025

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,922 146 Updated Apr 21, 2025

Retrieval-Augmented Generation in 3 Lines of Code!

Python 49 6 Updated Feb 3, 2025

AudioBench: A Universal Benchmark for Audio Large Language Models

Python 271 13 Updated Jun 17, 2025

【NeurIPS 2024】The official code of paper "Automated Multi-level Preference for MLLMs"

Python 20 1 Updated Sep 26, 2024

【NeurIPS 2024】Dense Connector for MLLMs

Python 180 8 Updated Oct 14, 2024

FreeVA: Offline MLLM as Training-Free Video Assistant

Python 64 1 Updated Jun 9, 2024

AcadHomepage: A Modern and Responsive Academic Personal Homepage

SCSS 2,370 4,813 Updated Nov 6, 2025

Awesome-LLM-Tabular: a curated list of Large Language Model applied to Tabular Data

417 32 Updated Dec 22, 2024

GPT4Vis: What Can GPT-4 Do for Zero-shot Visual Recognition?

Python 186 18 Updated May 22, 2024

【ICCV'2023】What Can Simple Arithmetic Operations Do for Temporal Modeling?

Python 74 6 Updated Jan 26, 2024

Demonstrate all the questions on LeetCode in the form of animation.(用动画的形式呈现解LeetCode题目的思路)

Java 76,504 14,016 Updated Aug 14, 2023

Enjoy https://shields.io

Go 453 250 Updated Nov 3, 2025
JavaScript 3,683 1,577 Updated Jun 21, 2024

[ICCV 2023] Official Implementation of "Generalized Lightness Adaptation with Channel Selective Normalization"

Python 85 6 Updated Jan 22, 2024

A curated list of papers and open-source resources focused on 3D AIGC.

331 17 Updated Sep 1, 2024

Badges for your personal developer branding, profile, and projects.

SCSS 15,777 1,763 Updated Nov 6, 2025

【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Python 3,393 245 Updated Dec 3, 2024
Python 42 3 Updated Apr 7, 2024
Next