Skip to content
View gaspard-doctrine's full-sized avatar
:copilot:
:copilot:

Block or report gaspard-doctrine

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Python 28,455 3,504 Updated Sep 24, 2024

The open-source, cross-platform API client for GraphQL, REST, WebSockets, SSE and gRPC. With Cloud, Local and Git storage.

TypeScript 37,600 2,183 Updated Nov 27, 2025

🧌 Parsing structured information from OCR outputs

Jupyter Notebook 20 1 Updated Dec 12, 2023

Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends

Python 1,789 198 Updated Nov 27, 2025

Rapid fuzzy string matching in Python using various string metrics

Python 3,543 144 Updated Nov 24, 2025

CodeVisualizer is a powerful VS Code extension that provides two main visualization capabilities: function-level flowcharts for understanding code control flow, and codebase-level dependency graphs…

TypeScript 375 32 Updated Nov 15, 2025

Toonify: Compact data format reducing LLM token usage by 30-60%

Python 247 15 Updated Nov 26, 2025

Tesseract Open Source OCR Engine (main repository)

C++ 71,115 10,400 Updated Oct 13, 2025

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Python 8,563 674 Updated Nov 27, 2025

AI Agents & MCPs & AI Workflow Automation • (~400 MCP servers for AI agents) • AI Automation / AI Agent with MCPs • AI Workflows & AI Agents • MCPs for AI Agents

TypeScript 19,308 2,968 Updated Nov 27, 2025

🎨 Ready-to-use DeepSeek-OCR Web UI | Modern Interface | 7 Recognition Modes | Batch Processing | Real-time Logging | Fully Responsive

HTML 229 51 Updated Nov 5, 2025

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 95,418 26,027 Updated Nov 27, 2025

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 94,702 10,690 Updated Nov 27, 2025

Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.

Python 9,471 1,182 Updated Nov 24, 2025

SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm

C# 3,334 309 Updated Nov 5, 2025

Pure Python Spell Checking http://pyspellchecker.readthedocs.io/en/latest/

Python 766 162 Updated Nov 26, 2025

Port of Google's language-detection library to Python.

Python 1,856 210 Updated Mar 3, 2025

LM Studio Python SDK

Python 706 108 Updated Oct 27, 2025

An interpretable regression model in Python with Random-Forest-level accuracy

Jupyter Notebook 6 Updated Nov 25, 2025

ContextGem: Effortless LLM extraction from documents

Python 1,728 138 Updated Nov 16, 2025

Boto3, an AWS SDK for Python

Python 9,607 1,936 Updated Nov 26, 2025

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 65,136 9,410 Updated Nov 27, 2025
Python 315 18 Updated Nov 5, 2025

ripgrep recursively searches directories for a regex pattern while respecting your gitignore

Rust 57,652 2,318 Updated Oct 30, 2025

The best ChatGPT that $100 can buy.

Python 37,652 4,615 Updated Nov 17, 2025

A developer-friendly API for converting numerous document formats into PDF files, and more!

Go 10,520 705 Updated Nov 24, 2025

Deploy any AI model, agent, database, RAG, and pipeline locally or remotely in minutes

Python 650 28 Updated Nov 26, 2025

Apache Lucene open-source search software

Java 3,256 1,253 Updated Nov 26, 2025

Free and Open Source, Distributed, RESTful Search Engine

Java 75,535 25,642 Updated Nov 27, 2025