Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 2,600 136 Updated Oct 9, 2025

huggingface / tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Rust 10,127 977 Updated Oct 8, 2025

wang2226 / Awesome-LLM-Decoding

📜 Paper list on decoding methods for LLMs and LVLMs

60 1 Updated Jun 30, 2025

SalesforceAIResearch / PretrainRL-pipeline

An automated data pipeline scaling RL to pretraining levels

Python 43 6 Updated Oct 11, 2025

PALIN2018 / BrowseComp-ZH

Python 114 5 Updated May 14, 2025

zhoushengisnoob / DeepClustering

Methods and Implements of Deep Clustering

3,018 425 Updated Aug 25, 2024

GasolSun36 / GRACE

Official repo for paper: "GRACE: Generative Representation Learning via Contrastive Policy Optimization"

Python 10 1 Updated Oct 4, 2025

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 23,720 2,643 Updated Aug 12, 2024

mlfoundations / open_clip

An open source implementation of CLIP.

Python 12,743 1,171 Updated Sep 21, 2025

Computer-Vision-in-the-Wild / CVinW_Readings

A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''

1,338 58 Updated Mar 14, 2024

fannie1208 / W4S

[COLM2025] "Weak-for-Strong: Training Weak Meta-Agent to Harness Strong Executors"

Python 35 1 Updated Oct 6, 2025

inclusionAI / ASearcher

An Open-Source Large-Scale Reinforcement Learning Project for Search Agents

Python 454 27 Updated Oct 8, 2025

yfzhang114 / Awesome-Multimodal-Large-Language-Models

Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models

665 25 Updated Sep 13, 2025

TIGER-AI-Lab / verl-tool

A version of verl to support diverse tool use

Python 591 43 Updated Oct 12, 2025

GAIR-NLP / LIMI

LIMI: Less is More for Agency

Python 138 7 Updated Oct 8, 2025

Parallel-Reasoning / APR

[COLM 2025] Code for Paper: Learning Adaptive Parallel Reasoning with Language Models

Python 132 10 Updated Aug 15, 2025

agent-infra / sandbox

All-in-One Sandbox for AI Agents that combines Browser, Shell, File, MCP and VSCode Server in a single Docker container.

Python 591 56 Updated Oct 10, 2025

svg-project / flash-kmeans

Fast and memory-efficient exact kmeans

Python 103 6 Updated Sep 30, 2025

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 24,990 1,741 Updated Oct 13, 2025

QwenLM / Qwen3-VL

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 14,165 1,075 Updated Oct 13, 2025

VectorSpaceLab / Infomatica

Data Synthesis for Deep Research Based on Semi-Structured Data

Python 169 13 Updated Oct 9, 2025

QwenLM / Qwen-Image

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 5,649 306 Updated Sep 30, 2025

OpenDCAI / DataFlow

Easy Data Preparation with latest LLMs-based Operators and Pipelines.

Python 1,377 91 Updated Oct 13, 2025

MiroMindAI / MiroThinker

MiroThinker is open-source agentic models trained for deep research and complex tool use scenarios.

Python 439 38 Updated Oct 2, 2025

Wei-Jie Xu vejaxu

Lists (14)

Awesome

Benchmark

Clustering

Deep Search

Dimension Reduction

Efficient Reasoning

LLM Compression

LLM Papers

LLM Serving

LLM Training

PyTorch

RL Framework

Tools

Transformers

Stars