Skip to content
View eweiguu's full-sized avatar

Block or report eweiguu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

📑 PageIndex: Document Index for Reasoning-based RAG

Jupyter Notebook 4,087 304 Updated Nov 20, 2025

A topic-centric list of HQ open datasets.

70,782 10,942 Updated Nov 5, 2025
Jupyter Notebook 99 11 Updated Dec 23, 2024

We write your reusable computer vision tools. 💜

Python 35,995 3,021 Updated Nov 24, 2025

The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.

Python 7,818 640 Updated Nov 6, 2025

Main reference implementation for NLWeb, implemented in Python.

Python 6,073 676 Updated Nov 6, 2025

所有小初高、大学PDF教材。

Roff 59,136 13,193 Updated Oct 18, 2025

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

Python 49,405 4,103 Updated Nov 21, 2025
Python 4 Updated Jul 4, 2023

AI PDF chatbot agent built with LangChain & LangGraph

TypeScript 16,188 3,221 Updated Feb 20, 2025

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

Python 24,460 2,051 Updated Jul 29, 2025

Rubiks lets you define an OLAP schema then generate a Mondrian XML or JSON file.

Ruby 3 2 Updated Jun 5, 2013

A command line VNC client and python library

Python 494 131 Updated Nov 2, 2025

[ECCV 2020] Flow-edge Guided Video Completion

Python 1,555 261 Updated Dec 14, 2021

Text classification using Naive Bayes and Elasticsearch

Python 152 17 Updated Aug 2, 2016