MingLunHan

🎯

Focusing

Rowanhart_bubu MingLunHan

🎯

Focusing

Research Scientist @ByteDance-Seed; Previously PhD @CASIA. I am interested in pre-training and post-training of omnimodal, multimodal, and audio large model.

70 followers · 108 following

Achievements

Lists (9)

Sort

🔮 Future ideas

✨ Inspiration

🚀 My stack

TN & ITN

1 repository

Starred repositories

AVoCaDO-Captioner / AVoCaDO

https://avocado-captioner.github.io/

Python 20 Updated Oct 16, 2025

inclusionAI / MingTok-Audio

Python 68 7 Updated Nov 12, 2025

inclusionAI / Ming-UniAudio

Ming-UniAudio: Speech LLM for Joint Understanding, Generation and Editing with Unified Representation

Python 393 28 Updated Nov 27, 2025

Alibaba-NLP / DeepResearch

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,359 1,329 Updated Nov 20, 2025

LLaVA-VL / LLaVA-NeXT

Python 4,416 424 Updated Sep 14, 2025

Gar-b-age / CookLikeHOC

🥢像老乡鸡🐔那样做饭。主要部分于2024年完工，非老乡鸡官方仓库。文字来自《老乡鸡菜品溯源报告》，并做归纳、编辑与整理。CookLikeHOC.

JavaScript 22,313 2,255 Updated Oct 17, 2025

QwenLM / Qwen3-Omni

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 2,987 175 Updated Oct 9, 2025

laude-institute / terminal-bench

A benchmark for LLMs on complicated tasks in the terminal

Python 1,133 401 Updated Nov 24, 2025

ai-agents-2030 / awesome-deep-research-agent

509 44 Updated Sep 18, 2025

BladeDancer957 / RevisitingCSS

Python 11 Updated Aug 7, 2025

phellonchen / MindVL

MindVL: Towards Efficient and Effective Training of Multimodal Large Language Models on Ascend NPUs

2 Updated Sep 29, 2025

tengjuilin / markdown-resume

A simple, elegant, and fast workflow to write resumes and CVs in Markdown.

HTML 92 47 Updated Jan 24, 2025

FireRedTeam / FireRedChat

A Fully Self-Hosted Solution for Full-Duplex Voice Interaction

Python 428 28 Updated Sep 28, 2025

QuantaAlpha / GitTaskBench

Repo-level benchmark for real-world Code Agents: from repo understanding → env setup → incremental dev/bug-fixing → task delivery, with cost-aware α metric.

Python 237 15 Updated Sep 22, 2025

invictus717 / MiCo

[ICCV 2025] Explore the Limits of Omni-modal Pretraining at Scale

Python 121 6 Updated Sep 2, 2024

ByteDance-Seed / seed-oss

Python 842 45 Updated Sep 15, 2025

xiaomi-research / r1-aqa

🤗 R1-AQA Model: mispeech/r1-aqa

Python 306 26 Updated Mar 28, 2025

DmitryRyumin / ICML-2025-Papers

ICML 2025 Papers: Dive into cutting-edge research from the premier machine learning conference. Stay current with breakthroughs in deep learning, generative AI, optimization, reinforcement learning…

20 Updated Oct 24, 2025

HW-whistleblower / True-Story-of-Pangu

诺亚盘古大模型研发背后的真正的心酸与黑暗的故事。

11,379 1,358 Updated Jul 9, 2025

YSGStudyHards / Awesome-Tools

🛠Awesome Tools，程序员常用高效实用工具、软件资源精选，办公效率提升利器（A Curated Collection of High-Efficiency and Practical Tools and Software Resources for Programmers to Boost Office Productivity）。

833 106 Updated Nov 20, 2025