Skip to content
View gebteus's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report gebteus

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

📰 Binary distribution of PDFium

Shell 1,199 234 Updated Nov 24, 2025

The best ChatGPT that $100 can buy.

Python 37,586 4,605 Updated Nov 17, 2025

Protocol Buffers for the rest of us

Python 205 12 Updated Nov 25, 2025

Morphological Analyser of Uzbek text based on affixes

Python 2 2 Updated Jan 18, 2025

SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, xDC replica…

Go 27,543 2,553 Updated Nov 26, 2025

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

Go 40,417 3,631 Updated Nov 26, 2025

Embedding Atlas is a tool that provides interactive visualizations for large embeddings. It allows you to visualize, cross-filter, and search embeddings and metadata.

TypeScript 4,184 214 Updated Nov 25, 2025

Custom Russian tokenizer for spaCy

Python 44 6 Updated May 14, 2019

Uzbek Lemmatizer for Python

Python 2 Updated Apr 29, 2023

Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages

Python 7,677 927 Updated Nov 26, 2025

Snowball compiler and stemming algorithms

C 816 190 Updated Nov 13, 2025

A neural word aligner based on multilingual BERT

Python 359 58 Updated Mar 10, 2022

An easy-to-use library to linguistically compare one sentence and its words to another, in the same language or a different one. For instance useful for comparing a translation with the original te…

Python 25 Updated Nov 27, 2021

Pacific Drive UEVR compatibility plugin

C++ 19 1 Updated Mar 2, 2024

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,299 1,944 Updated Nov 1, 2025

Renderer for the harmony response format to be used with gpt-oss

Rust 4,030 230 Updated Nov 5, 2025

State-of-the-art TTS model under 25MB 😻

Python 9,125 459 Updated Aug 23, 2025

SPT-VR brings the immersive, intense experience of Tarkov into the realm of virtual reality. Engage in intense firefights, loot dangerous environments, and survive the unforgiving world of Tarkov—a…

C# 55 7 Updated Nov 26, 2025

RuPAWS: A Russian Adversarial Dataset for Paraphrase Identification

1 Updated Jan 20, 2022

SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.

Python 845 92 Updated Oct 10, 2025

GPU cluster manager for optimized AI model deployment

Python 4,075 409 Updated Nov 26, 2025

Python3 bindings for the Compact Language Detector v3 (CLD3)

C++ 155 6 Updated Jun 26, 2023

Library for studying Cayley graphs and Schreier coset graphs

Python 409 23 Updated Nov 23, 2025

✂️ Sentence segmentation with wtpsplit's state-of-the-art Segment any Text (SaT) models

Python 21 2 Updated Oct 1, 2025

Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.

Python 1,197 75 Updated Nov 19, 2025

Crysis. In VR.

C++ 221 9 Updated Feb 16, 2025

NLP, before and after spaCy

Python 2,234 249 Updated Sep 22, 2023

🧹 Python package for text cleaning

Python 996 80 Updated May 9, 2023

The most accurate natural language detection library for Python, suitable for short text and mixed-language text

Python 1,568 52 Updated Nov 21, 2025

A .NET library to access files and directories with more than 260 characters length.

C# 149 29 Updated Jun 11, 2023
Next