Skip to content
View onetthree's full-sized avatar

Block or report onetthree

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.

Python 317 52 Updated Jul 31, 2025

A lexicon for Sudachi

Python 265 20 Updated Aug 29, 2025

A distributed approximate nearest neighborhood search (ANN) library which provides a high quality vector index build, search and distributed online serving toolkits for large scale vector search sc…

C++ 4,948 602 Updated Oct 30, 2025

Train emoji embeddings based on emoji descriptions.

Python 18 7 Updated Mar 31, 2018

emoji2vec: Learning Emoji Representations from their Description

Jupyter Notebook 269 51 Updated Aug 18, 2022

libiconv Windows build with Visual Studio.

C 113 45 Updated Jul 25, 2025

Seamless operability between C++11 and Python

C++ 17,413 2,234 Updated Nov 3, 2025

Google AI 2018 BERT pytorch implementation

Python 6,493 1,328 Updated Sep 15, 2023

General purpose C++ library for PFI

C++ 160 39 Updated Sep 13, 2021

Lightweight C++ command line option parser

C++ 4,602 626 Updated Nov 1, 2025

Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition, Part-of-Speech Tagging and so on.

Python 1,486 365 Updated Dec 7, 2022

Sequence to Sequence Learning with Keras

Python 3,173 839 Updated Aug 20, 2022

Python version of Sudachi, a Japanese tokenizer.

Python 417 51 Updated Oct 7, 2022

形態素解析器性能評価システム MevAL

Java 7 2 Updated Aug 13, 2019

A C++11 library for serialization

C++ 4,542 813 Updated Jan 20, 2025

BlackOut and Adaptive Softmax for language models by Chainer

Python 11 3 Updated Oct 20, 2017

Various examples for Kotlin

3,216 1,032 Updated Jan 22, 2024

The tool to make NLP datasets ready to use

Python 242 32 Updated Oct 20, 2022

Fairy Morphological Annotated Corpus

Python 7 2 Updated Dec 14, 2017

A large parallel corpus of English and Japanese

Python 86 12 Updated Nov 1, 2017

A clone of Darts (Double-ARray Trie System)

C++ 154 48 Updated May 14, 2025

Yet Another Japanese Dependency Structure Analyzer

C++ 115 24 Updated Feb 22, 2025

An open-source NLP research library, built on PyTorch.

Python 11,881 2,242 Updated Nov 22, 2022

💫 Industrial-strength Natural Language Processing (NLP) in Python

Python 32,756 4,613 Updated Oct 28, 2025

A Japanese Tokenizer for Business

Java 906 73 Updated Jun 17, 2025

これまで研究室の勉強会などで使ってきた資料など

3 Updated Aug 27, 2019

Code accompanying our EMNLP paper Learning Language Representations for Typology Prediction

Python 71 5 Updated Aug 19, 2017

A Python library to use infix notation in Python

Python 2,080 117 Updated Mar 23, 2025

Code for SAILORS 2017 NLP project

Jupyter Notebook 35 26 Updated Jun 18, 2022
Next