Skip to content
View eli64s's full-sized avatar

Highlights

  • Pro

Block or report eli64s

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

♕ RAG

Retrieval-Augmented Generation (RAG) resources.
10 repositories

Adding guardrails to large language models.

Python 5,814 458 Updated Oct 10, 2025

Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website …

HTML 12,963 1,065 Updated Oct 15, 2025
Python 195 11 Updated May 5, 2024

DocLLM: A layout-aware generative language model for multimodal document understanding

129 6 Updated Jan 3, 2024

Get clean data from tricky documents, powered by vision-language models ⚡

Python 1,315 79 Updated Sep 21, 2025

HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieval Results in RAG Systems (WWW 2025)

Python 447 36 Updated Jun 11, 2025
Python 897 111 Updated Oct 26, 2024

NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extraction uses specialized NVIDIA NIM microservices to find, con…

Python 2,751 269 Updated Oct 17, 2025

LlamaIndex is the leading framework for building LLM-powered agents over your data.

Python 44,776 6,455 Updated Oct 18, 2025

A curated list of awesome synthetic data tools (open source and commercial).

214 29 Updated Jan 11, 2024