Skip to content
View gfranzini's full-sized avatar
🐏
Margarita Francorum
🐏
Margarita Francorum

Block or report gfranzini

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Stuart is the Open Data Hub LLM based chatbot. Stuart helps the Open Data Hub customer care team in solving the tickets. Stuart uses as input: the Open Data Hub Wiki, the past tickets history and t…

Python 11 3 Updated Dec 19, 2025

This repository hosts materials from the CLiC-IT 2023 tutorial

Jupyter Notebook 30 2 Updated Jun 5, 2024

A repository for illustrating the transformation of a XML ALTO file into XML-TEI format

XSLT 5 Updated Jun 30, 2022

Breviloquia Italica: data pipeline

Jupyter Notebook 2 Updated Feb 5, 2024

List of Computer Science courses with video lectures.

70,489 9,434 Updated Dec 14, 2025

All4Ling Citizen Science at Eurac Research

HTML 2 Updated Feb 10, 2025

Python tools for performing various operations on ALTO XML files

Python 48 17 Updated Feb 27, 2025

models

5 Updated Apr 6, 2020

Tutorial on NE processing for Digital Humanities - DH Utrech 2019

Jupyter Notebook 25 4 Updated Jul 18, 2019

extract text from ALTO file

Python 9 6 Updated Sep 26, 2023

Named Entities Recognition Annotator Tool for Europeana Newspapers

Java 61 6 Updated Jan 12, 2018

Java based viewer for PAGE XML files (layout + text content). Also supports ALTO XML, FineReader XML, and HOCR.

HTML 35 9 Updated May 25, 2023

Named Entity Recognition data for Europeana Newspapers

173 31 Updated Apr 5, 2023
Jupyter Notebook 260 34 Updated Jul 7, 2025

QA-tool for scans with corresponding ALTO-files

Shell 26 6 Updated Dec 2, 2022

calculate OCR confidence per page in ALTO

Python 3 1 Updated Sep 26, 2023

ALTO XML schema - latest and all former versions

55 4 Updated Dec 5, 2025

Text conversion tool (from e.g. Word, HTML, txt) to corpus formats TEI or FoLiA)

Java 23 9 Updated Feb 11, 2022

Conversions between various OCR formats

82 3 Updated May 13, 2023

Flexible INtegrated Transformation and Annotation eNgineering platform

JavaScript 6 1 Updated Jun 28, 2022

A curated list of various semantic web and linked data resources.

1,586 259 Updated Jul 20, 2025

Named entity annotation tool

JavaScript 28 5 Updated Jul 6, 2023

Python-based tools for document analysis and OCR

Jupyter Notebook 3,467 596 Updated May 22, 2021

Edition Visualization Technology version 3

TypeScript 29 20 Updated Dec 1, 2025

OCR engine for all the languages

Python 926 155 Updated Dec 18, 2025

Pure Javascript OCR for more than 100 Languages πŸ“–πŸŽ‰πŸ–₯

JavaScript 37,671 2,355 Updated Jan 1, 2026

A Mashup Interface for Text Analysis Operations

JavaScript 13 2 Updated Dec 23, 2024

Edition Visualization Technology 2 - development

JavaScript 80 19 Updated Dec 30, 2025

Transc&Anno is a transcription tool adapted for learner corpora and based on the open source software FromThePage (http://beta.fromthepage.com).

JavaScript 2 1 Updated Nov 16, 2022
Next