Skip to content
View MingLunHan's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report MingLunHan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

https://avocado-captioner.github.io/

Python 20 Updated Oct 16, 2025
Python 68 7 Updated Nov 12, 2025

Ming-UniAudio: Speech LLM for Joint Understanding, Generation and Editing with Unified Representation

Python 393 28 Updated Nov 27, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,359 1,329 Updated Nov 20, 2025
Python 4,416 424 Updated Sep 14, 2025

🥢像老乡鸡🐔那样做饭。主要部分于2024年完工,非老乡鸡官方仓库。文字来自《老乡鸡菜品溯源报告》,并做归纳、编辑与整理。CookLikeHOC.

JavaScript 22,313 2,255 Updated Oct 17, 2025

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 2,987 175 Updated Oct 9, 2025

A benchmark for LLMs on complicated tasks in the terminal

Python 1,133 401 Updated Nov 24, 2025
Python 11 Updated Aug 7, 2025

MindVL: Towards Efficient and Effective Training of Multimodal Large Language Models on Ascend NPUs

2 Updated Sep 29, 2025

A simple, elegant, and fast workflow to write resumes and CVs in Markdown.

HTML 92 47 Updated Jan 24, 2025

A Fully Self-Hosted Solution for Full-Duplex Voice Interaction

Python 428 28 Updated Sep 28, 2025

Repo-level benchmark for real-world Code Agents: from repo understanding → env setup → incremental dev/bug-fixing → task delivery, with cost-aware α metric.

Python 237 15 Updated Sep 22, 2025

[ICCV 2025] Explore the Limits of Omni-modal Pretraining at Scale

Python 121 6 Updated Sep 2, 2024
Python 842 45 Updated Sep 15, 2025

🤗 R1-AQA Model: mispeech/r1-aqa

Python 306 26 Updated Mar 28, 2025

ICML 2025 Papers: Dive into cutting-edge research from the premier machine learning conference. Stay current with breakthroughs in deep learning, generative AI, optimization, reinforcement learning…

20 Updated Oct 24, 2025

诺亚盘古大模型研发背后的真正的心酸与黑暗的故事。

11,379 1,358 Updated Jul 9, 2025

🛠Awesome Tools,程序员常用高效实用工具、软件资源精选,办公效率提升利器(A Curated Collection of High-Efficiency and Practical Tools and Software Resources for Programmers to Boost Office Productivity)。

833 106 Updated Nov 20, 2025
Python 699 12 Updated Nov 20, 2025

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 11,041 1,121 Updated Apr 30, 2025

Latest Advances on System-2 Reasoning

Python 1,277 73 Updated Jun 8, 2025

MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources

Python 209 9 Updated Sep 26, 2025

The first Large Audio Language Model that enables native in-depth thinking, which is trained on large-scale audio Chain-of-Thought data.

Python 268 24 Updated May 15, 2025

Visual R1: Trasfer Reasoning Ability from R1 to Visual R1

3 Updated Feb 15, 2025
Python 37 2 Updated Aug 26, 2025
Python 4,561 366 Updated Jun 12, 2025

Latest Advances on Reasoning of Multimodal Large Language Models (Multimodal R1 \ Visual R1) ) 🍓

35 Updated Apr 3, 2025
Next