-
RealChar Public
Forked from Shaunwei/RealChar🎙️🤖Create, Customize and Talk to your AI Character/Companion in Realtime (All in One Codebase!). Have a natural seamless conversation with AI everywhere (mobile, web and terminal) using LLM OpenAI …
Swift MIT License UpdatedJul 19, 2023 -
-
-
CTranslate2 Public
Forked from OpenNMT/CTranslate2Fast inference engine for OpenNMT models
C++ MIT License UpdatedMay 7, 2021 -
MPNet Public
Forked from microsoft/MPNetMPNet: Masked and Permuted Pre-training for Language Understanding https://arxiv.org/pdf/2004.09297.pdf
Python MIT License UpdatedSep 26, 2020 -
ake-datasets Public
Forked from boudinfl/ake-datasetsLarge, curated set of benchmark datasets for evaluating automatic keyphrase extraction algorithms.
Shell Apache License 2.0 UpdatedJul 3, 2020 -
-
sentence-transformers Public
Forked from huggingface/sentence-transformersSentence Embeddings with BERT & XLNet
Python Apache License 2.0 UpdatedJan 29, 2020 -
dastrie Public
Forked from chokkan/dastrieStatic Double Array Trie (DASTrie)
-
korean-sentence-splitter Public
Forked from likejazz/korean-sentence-splitterSplit Korean text into sentences using heuristic algorithm.
C++ BSD 3-Clause "New" or "Revised" License UpdatedSep 11, 2019 -
-
-
NER Public
Forked from kmounlp/NER한국어 개체명 정의 및 표지 표준화 기술보고서와 이를 기반으로 제작된 개체명 형태소 말뭉치
UpdatedMay 21, 2019 -
KenLM: Faster and Smaller Language Model Queries
C++ Other UpdatedApr 11, 2019 -
-
Ivory Public
Forked from lintool/IvoryA Hadoop toolkit for web-scale information retrieval research
Java UpdatedNov 29, 2018 -
-
OpenNMT-py Public
Forked from OpenNMT/OpenNMT-pyOpen-Source Neural Machine Translation in PyTorch http://opennmt.net/
Python Other UpdatedJul 12, 2017 -
-
text Public
Forked from pytorch/textPython BSD 3-Clause "New" or "Revised" License UpdatedMay 17, 2017 -
subword-nmt Public
Forked from rsennrich/subword-nmtSubword Neural Machine Translation
Python MIT License UpdatedFeb 27, 2017 -
-
opensubtitles-parser Public
Forked from domerin0/opensubtitles-parserdownload, extract, parse and tokenize the opensubtitles dataset with this script
Python MIT License UpdatedJan 3, 2017 -
-
-
extractor-wiki-data Public
extracting multiple-language data from wiki-data
Python UpdatedJul 5, 2016 -