Lists (1)
Sort Name ascending (A-Z)
Stars
📑 PageIndex: Document Index for Reasoning-based RAG
A topic-centric list of HQ open datasets.
We write your reusable computer vision tools. 💜
The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.
Main reference implementation for NLWeb, implemented in Python.
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
AI PDF chatbot agent built with LangChain & LangGraph
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
Rubiks lets you define an OLAP schema then generate a Mondrian XML or JSON file.
[ECCV 2020] Flow-edge Guided Video Completion
Text classification using Naive Bayes and Elasticsearch