Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 63,703 9,329 Updated Nov 10, 2025

XinyuYanTJU / LawDIS

[ICCV'2025] LawDIS: Language-Window-based Controllable Dichotomous Image Segmentation

Python 42 1 Updated Aug 5, 2025

rougier / scientific-visualization-book

An open access book on scientific visualization using python and matplotlib

Python 11,106 1,015 Updated Jan 22, 2024

open-compass / VLMEvalKit

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 3,362 548 Updated Nov 8, 2025

CellRecog / cellRecog

Trichomonas Vaginalis Segmentation in Microscope Images, MICCAI2022

Python 2 1 Updated Jul 10, 2022

ai4colonoscopy / IntelliScope

Frontiers in Intelligent Colonoscopy [ColonSurvey | ColonINST | ColonGPT]

Python 92 7 Updated Oct 9, 2025

huggingface / transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 152,472 31,127 Updated Nov 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jancsi9981

Highlights

Block or report Jancsi9981

Stars

zjunlp / KnowledgeEditingPapers

hiyouga / EasyR1

om-ai-lab / VLM-R1

StarsfieldAI / R1-V

open-mmlab / mmpose

cocodataset / cocoapi

fscdc / Awesome-Efficient-Reasoning-Models

HiLab-git / SSL4MIS

mlabonne / llm-course

zli12321 / Vision-Language-Models-Overview

jingyi0000 / VLM_survey

sing-group / deep-learning-colonoscopy

swordlidev / Evaluation-Multimodal-LLMs-Survey