-
bluesky-scrape Public
Scraping posts from BlueSky
Python GNU General Public License v3.0 UpdatedApr 27, 2025 -
Autoria Public
Authorship attribution: it uses the mean of 4 distance measures (perplexity, kullbackleibler, ranking, and cosine) to compute the distance between the text of an unknown author and the texts of kno…
Perl UpdatedMar 15, 2025 -
Datasets to train a sentiment analysis model for Galician language
Creative Commons Zero v1.0 Universal UpdatedFeb 16, 2025 -
galeXtra Public
Multiword Extractor for Portuguese, English, Spanish, Galician, French
-
-
reddit-scrape Public
A python script to scrape posts from Reddit with keywords, by using Reddit API credentials
-
mastodon-scrape Public
Scrapper of toots from Mastodon
-
Perplexity Public
Language Distance Measure
-
transcription-video Public
Speech transcription from Youtube and mp4 videos with Whisper
-
-
-
-
alignment2dict Public
It creates bilingual probabilistic dictionaries from aligned sentences (parallel corpus)
Perl GNU General Public License v3.0 UpdatedMar 29, 2022 -
DepFunc Public
Compositiona distributional semantics with syntactic dependencies
Perl GNU General Public License v3.0 UpdatedMar 16, 2022 -
LanguageModel Public
N-Gram Language Models
Perl GNU General Public License v3.0 UpdatedOct 20, 2021 -
-
-
topomedieval Public
Extrator automático de topónimos em corpus medievais galego-portugueses
Perl GNU General Public License v3.0 UpdatedApr 3, 2021 -
CrossLingual Public
Method to build transparent cross-lingual models from monolingual corpora
-
-
LanguageDistance Public
ALD: Average Language Distance
-
Discourse_Parser Public
Discourse parser based on DepPattern (dependency-base parsing)
-
-
-
DepPattern Public
Dependency Syntactic Parsing for Portuguese, Spanish, English, and Galician, including MetaRomance parser
-
depression_classification Public
Lexicon-based method to detect depressive language
-
-
Linguakit Public
Forked from citiususc/LinguakitMultilingual toolkit for NLP: dependency parser, PoS tagger, NERC, multiword extractor, sentiment analysis, etc.
Perl GNU General Public License v3.0 UpdatedSep 5, 2017 -
UD_Galician-TreeGal Public
Forked from UniversalDependencies/UD_Galician-TreeGalOther UpdatedFeb 25, 2017 -
CitiusSentiment Public
Sentiment analysis (opinion mining) for Portuguese, English, Spanish, and Galician