Skip to content
View hyunlin's full-sized avatar

Block or report hyunlin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
Python 1 Updated Dec 27, 2021

yolo3+ocr

Python 6,109 1,722 Updated Aug 29, 2022

Mathematical derivation and pure Python code implementation of machine learning algorithms.

Jupyter Notebook 1,543 592 Updated Sep 18, 2024

A pytorch re-implementation of Real-time Scene Text Detection with Differentiable Binarization

Python 1,007 257 Updated Dec 29, 2022

基于pytorch的ocr算法库,包括 psenet, pan, dbnet, sast , crnn

C++ 682 131 Updated May 19, 2021

CTPN + DenseNet + CTC based end-to-end Chinese OCR implemented using tensorflow and keras

Python 2,786 1,073 Updated Oct 8, 2019

Generate text images for training deep learning ocr model

Python 1,455 388 Updated Jan 17, 2022

超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M

C++ 12,240 2,292 Updated Aug 14, 2023

Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).

Python 15,321 3,606 Updated Nov 29, 2025

table structure recognition

Python 275 93 Updated Nov 22, 2022

Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)

5,951 995 Updated Feb 15, 2023

链家二手房租房在线数据,存量房交易服务平台数据,详细数据分析教程

Python 1 Updated Aug 15, 2019

KenLM: Faster and Smaller Language Model Queries

C++ 2,697 533 Updated Mar 30, 2025

速度更快、效果更好的中文新词发现

Python 514 102 Updated Mar 15, 2024

Data augmentation for NLP

Jupyter Notebook 4,631 474 Updated Jun 24, 2024

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Python 14,324 2,130 Updated Oct 27, 2025

Unsupervised Data Augmentation (UDA)

Python 2,202 313 Updated Aug 28, 2021

Repository for the English-Hindi Codemixed to Monolingual English Parallel Corpus

13 3 Updated Feb 17, 2019

Language Identification and transliteration tool for Indian language code mixed data.

Python 23 14 Updated Feb 29, 2016

Neural Machine Translation with Attention (PyTorch)

Jupyter Notebook 44 15 Updated Nov 13, 2018

Open Source Neural Machine Translation and (Large) Language Models in PyTorch

Python 6,974 2,260 Updated Oct 14, 2025

including text classifier, language model, pre_trained model, multi_label classifier, text generator, dialogue. etc

Python 472 221 Updated Dec 24, 2019

Minimal Seq2Seq model with Attention for Neural Machine Translation in PyTorch

Python 701 168 Updated Dec 13, 2020

An implementation of attention-based neural machine translation using Pytorch

Python 49 9 Updated Dec 19, 2018

The project aims on adding a state-of-the-art transliteration module for cross transliterations among all Indian languages including English.

Python 272 62 Updated Oct 28, 2022

Xlit-Crowd: Hindi-English Transliteration Corpus

38 39 Updated Feb 17, 2015

Tutorial on English to Hindi Transliteration using Seq2Seq Architecture in Tensorflow

Jupyter Notebook 16 10 Updated Aug 28, 2019

It is a simple tool to convert roman script to indic(Devanagari) script. As most Keyboards are English and to write in Indic script is difficult. It is easy to write Hindi in roman script this give…

C 13 1 Updated Aug 31, 2016

An unsupervised stemmer for Natural Language Processing Tasks on Hinglish Language ( Hindi + English words )

Jupyter Notebook 8 Updated Dec 6, 2019

A Hindi-English Dataset for Text Normalization

Python 17 5 Updated Jan 3, 2022
Next