Stars
The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.
Context retrieval for AI agents across apps and databases
The AI Browser Automation Framework
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
the UI and agentic backend of the fred innovation track
Open-Source Memory Engine for LLMs, AI Agents & Multi-Agent Systems
AI conversations that actually remember. Never re-explain your project to your AI again. Join our Discord: https://discord.gg/tyvKNccgqN
Fractalic: Build and version-control AI systems using Markdown & YAML. Combine LLM calls, shell commands, and modular workflows in a human-readable format. Docker-first installation, Git-native tra…
🌌 A React toolkit for graph visualization based on G6.
Quill is a modern WYSIWYG editor built for compatibility and extensibility
Chat2Graph: Graph Native Agentic System.
GeoAI: Artificial Intelligence for Geospatial Data
A Python library to extract tabular data from PDFs
Create and modify Word documents with Python
Collection of extracted System Prompts from popular chatbots like ChatGPT, Claude & Gemini
Modern observability platform: 10x easier, 140x lower storage cost, petabyte scale. Open-source alternative to Elasticsearch/Splunk/Datadog for logs, metrics, traces, RUM, and more.
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Community maintained fork of pdfminer - we fathom PDF
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The service allows for the segmentation and classification of differen…
Uwazi is a web-based, open-source solution for building and sharing document collections
Knowledge Agents and Management in the Cloud
This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.