Skip to content
View julowe's full-sized avatar

Highlights

  • Pro

Block or report julowe

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

#1 Locally hosted web application that allows you to perform various operations on PDF files

Java 69,437 5,868 Updated Nov 3, 2025

Jobs scraper library for LinkedIn, Indeed, Glassdoor, Google, ZipRecruiter & more

Python 2,303 485 Updated Aug 23, 2025

Readest is a modern, feature-rich ebook reader designed for avid readers offering seamless cross-platform access, powerful tools, and an intuitive interface to elevate your reading experience.

TypeScript 14,445 763 Updated Nov 2, 2025

A cross platform desktop reading app, based on the Readium Desktop toolkit

TypeScript 2,431 200 Updated Nov 1, 2025

A free self-hostable speed reader. Highly customizable. Implements chunking (RSVP), pacing and highlighting. Modern UI and local-storage only.

HTML 248 12 Updated Nov 10, 2024

Master programming by recreating your favorite technologies from scratch.

Markdown 432,837 40,640 Updated Oct 10, 2025

Mapping photos of Old New York

Python 293 130 Updated Dec 1, 2024

Tool for extracting important terms from a PDF and generating a printable index.

Python 8 3 Updated Sep 27, 2023

Collection of OCR-related python tools and wrappers from @OCR-D

Python 131 33 Updated Oct 15, 2025

Find the most popular fork on GitHub

TypeScript 130 7 Updated Nov 3, 2025

The Quranic Arabic Corpus, an invaluable linguistic resource, is due for a revamp. We're calling on Linguistics, AI, and Tech volunteers to join us in this exciting journey. ๐Ÿš€

TypeScript 1 Updated Jul 7, 2025

The Quranic Arabic Corpus, an invaluable linguistic resource, is due for a revamp. We're calling on Linguistics, AI, and Tech volunteers to join us in this exciting journey. ๐Ÿš€

TypeScript 108 15 Updated Jul 21, 2023
Python 51 7 Updated Apr 18, 2022

A chrome/firefox extension that download books from Internet Archive(archive.org) and HathiTrust Digital Library (hathitrust.org)

JavaScript 900 67 Updated Jul 19, 2025

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

Python 31,650 2,198 Updated Oct 27, 2025

Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.

Python 2,768 194 Updated Feb 27, 2025

A post-processing tool for scanned sheets of paper.

C 1,122 91 Updated Jul 11, 2024

Get your documents ready for gen AI

Python 42,873 3,065 Updated Oct 31, 2025

Toolkit for linearizing PDFs for LLM datasets/training

Python 15,703 1,189 Updated Oct 31, 2025

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 62,388 9,211 Updated Oct 31, 2025

OCR engine for all the languages

Python 904 152 Updated Nov 2, 2025

Full text, footnotes, and formatting of the ASV Bible (1901).

35 8 Updated Jul 1, 2021

Explore machine learning and data science with Codespaces

Jupyter Notebook 753 1,538 Updated Sep 5, 2025

A Github template for writing LaTeX documents collaboratively with automatic rendering using Github actions.

TeX 24 7 Updated Jan 17, 2023

:octocat: GitHub Action to compile LaTeX documents

Shell 1,299 143 Updated Jul 9, 2025

Web interface for recognizing text, proofreading OCR, and creating fully-digitized documents.

JavaScript 665 36 Updated Oct 28, 2025

Working with hOCR in Javascript

HTML 136 19 Updated Mar 4, 2023

Web based JavaScript GUI library for proofreading/editing hOCR

JavaScript 100 27 Updated Sep 17, 2018

A web-based hOCR editor with visual overlay editing and intelligent OCR processing optimized for handwritten text.

Go 2 Updated Oct 15, 2025

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Python 28,299 3,494 Updated Sep 24, 2024
Next