Skip to content
View lepto2014's full-sized avatar

Block or report lepto2014

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 7,093 391 Updated Jul 11, 2024

Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.

Jupyter Notebook 133 24 Updated Oct 18, 2025
Python 80 21 Updated Jun 12, 2023

Publicly released code for the LAMBERT model

Python 103 15 Updated Jun 14, 2021

TensorFlow implementation of Neural Variational Inference for Text Processing

Python 536 76 Updated Aug 10, 2016

Companion repository to GANs in Action: Deep learning with Generative Adversarial Networks

Jupyter Notebook 1,026 421 Updated Jul 23, 2025

Implementation of Graph Auto-Encoders in TensorFlow

Python 1,712 351 Updated Jan 3, 2020

A Java library for generating String from a regular expression.

Java 382 75 Updated Apr 28, 2021

NeuSpell: A Neural Spelling Correction Toolkit

Python 696 102 Updated Jul 31, 2023

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Python 14,308 2,130 Updated Aug 18, 2025

A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.

Python 1,552 248 Updated Jun 12, 2025

This repository contains the code and implementation details of the CascadeTabNet paper "CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents"

Python 1,549 431 Updated Aug 27, 2021

Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task of Visual Document Understanding (VDU)

Python 286 40 Updated Feb 13, 2023

A pytorch implement of scalable neural netowrks.

Python 23 6 Updated Jun 9, 2020

Google Research

Jupyter Notebook 36,596 8,221 Updated Oct 15, 2025

Joint Extraction of Entities and Relations Based on cnn+rnn

Python 183 53 Updated Jun 1, 2021

Pytorch implementation of Paper by Google Research - Representation Learning for Information Extraction from Form-like Documents.

Python 99 29 Updated Nov 26, 2022

Deep neural network to extract intelligent information from invoice documents.

Python 2,653 411 Updated May 3, 2024

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 151,571 30,932 Updated Oct 24, 2025

Multi-Content GAN for Few-Shot Font Style Transfer at CVPR 2018

Python 447 124 Updated Jan 16, 2019

DIAR software for synthetic document image and groundtruth generation, with various degradation models for data augmentation

C++ 128 25 Updated Nov 29, 2023

Document Layout Analysis resources repos for development with PdfPig.

C# 625 68 Updated Oct 1, 2023

视觉预训练基础模型仓库

Python 501 92 Updated Apr 12, 2023

ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction

Python 401 132 Updated Jul 20, 2020

State-of-the-Art Text Embeddings

Python 17,753 2,703 Updated Oct 22, 2025

PyTorch implementation of SwAV https//arxiv.org/abs/2006.09882

Python 2,075 285 Updated Apr 13, 2023

LayoutGAN-Tensorflow

Python 57 15 Updated Sep 11, 2020

Keras implementations of Generative Adversarial Networks.

Python 9,237 3,110 Updated Dec 12, 2022

AutoML for clustering models in sklearn.

Python 61 21 Updated Jan 9, 2023
Next