Lists (1)
Sort Name ascending (A-Z)
Starred repositories
🦛 CHONK docs with Chonkie ✨ — The no-nonsense RAG library
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
A high-throughput and memory-efficient inference and serving engine for LLMs
The Natural Portuguese Language Benchmark (Napolab). Stay up to date with the latest advancements in Portuguese language models and their performance across carefully curated Portuguese language ta…
Back up your device without vendor lock-ins, using insecure software or root. Supports encryption and compression out of the box. Works cross-platform.
repository for documents and studies about closed domain question and answering with LLM
💬 RasaGPT is the first headless LLM chatbot platform built on top of Rasa and Langchain. Built w/ Rasa, FastAPI, Langchain, LlamaIndex, SQLModel, pgvector, ngrok, telegram
Deliver safe & effective language models
General technology for enabling AI capabilities w/ LLMs and MLLMs
HateBR is the first large-scale expert annotated dataset of Brazilian Instagram comments for hate speech and offensive language detection on the web and social media.
HAREM dataset preprocessing script with XML to JSON conversion.
Convert Machine Learning Code Between Frameworks
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
Portuguese translation of the GLUE benchmark and Scitail dataset
SemClinBr - a multi-institutional and multi-specialty semantically annotated corpus for Portuguese clinical NLP tasks
More interactive weak supervision with FlyingSquid
Hashtag segmentation with a simple BiLSTM RNN. [BSc thesis]
Hashformers is a framework for hashtag segmentation with Transformers and Large Language Models (LLMs).
High performance distributed framework for training deep learning recommendation models based on PyTorch.
☁️ Build multimodal AI applications with cloud-native stack
LightXML: Transformer with dynamic negative sampling for High-Performance Extreme Multi-label Text Classification
Idea plugin, changes font size if retina display is detected