NLP
💫 Industrial-strength Natural Language Processing (NLP) in Python
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
ANTLR (ANother Tool for Language Recognition) is a powerful parser generator for reading, processing, executing, or translating structured text or binary files.
Easy-to-use and powerful LLM and SLM library with awesome model zoo.
Robust Speech Recognition via Large-Scale Weak Supervision
🦜🔗 Build context-aware reasoning applications
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
Rust native ready-to-use NLP pipelines and transformer-based models (BERT, DistilBERT, GPT2,...)
Lark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity.