Starred repositories
RUCAIBox / CIKM2020-S3Rec
Forked from aHuiWang/CIKM2020-S3RecCode for CIKM2020 "S3-Rec: Self-Supervised Learning for Sequential Recommendation with Mutual Information Maximization"
【TMM 2025🔥】 Mixture-of-Experts for Large Vision-Language Models
Accessible large language models via k-bit quantization for PyTorch.
Convert PDF to markdown + JSON quickly with high accuracy
Implementation of Nougat Neural Optical Understanding for Academic Documents
【ArXiv】PDF-Wukong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling
An open-source implementaion for fine-tuning Qwen-VL series by Alibaba Cloud.
This project is a collection of fine-tuning scripts to help researchers fine-tune Qwen 2 VL on HuggingFace datasets.
A Comprehensive Toolkit for High-Quality PDF Content Extraction
A high-throughput and memory-efficient inference and serving engine for LLMs
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
PyTorch implementation of "Supervised Contrastive Learning" (and SimCLR incidentally)
Research Code for Multimodal-Cognition Team in Ant Group
DeepSeek-VL: Towards Real-World Vision-Language Understanding
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com
Demos, examples and utilities using PyMuPDF
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.