ocrbase

Turn PDFs into structured data at scale. Powered by frontier open-weight OCR models with a type-safe TypeScript SDK.

Features

Best-in-class OCR - PaddleOCR-VL-0.9B for accurate text extraction
Structured extraction - Define schemas, get JSON back
Built for scale - Queue-based processing for thousands of documents
Type-safe SDK - Full TypeScript support with React hooks
Real-time updates - WebSocket notifications for job progress (client-side)
Self-hostable - Run on your own infrastructure

Quick Start

npm install ocrbase

# .env
OCRBASE_API_KEY=sk_xxx

Important: Jobs are processed asynchronously. Use WebSocket for real-time status updates.

import { createClient } from "ocrbase";

const { parse, extract, ws } = createClient({
  apiKey: process.env.OCRBASE_API_KEY,
});

// Start parsing (returns job with status "pending")
const job = await parse({ file: document });

// Subscribe to real-time updates via WebSocket
ws.subscribeToJob(job.id, {
  onComplete: (result) => {
    console.log(result.markdownResult); // string - the parsed markdown
  },
});

React Integration

Use the useJobSubscription hook for real-time updates:

import { useParse, useJobSubscription } from "ocrbase/react";

function DocumentParser() {
  const [jobId, setJobId] = useState<string | null>(null);
  const parse = useParse();

  const handleFile = (file: File) => {
    parse.mutate({ file }, { onSuccess: (job) => setJobId(job.id) });
  };

  const { status, job } = useJobSubscription(jobId!, { enabled: !!jobId });

  if (status === "completed" && job) {
    return <pre>{job.markdownResult}</pre>;
  }

  return (
    <div>
      <input type="file" onChange={(e) => handleFile(e.target.files![0])} />
      {status && <p>Status: {status}</p>}
    </div>
  );
}

LLM Integration

Best practice: Parse documents with ocrbase before sending to LLMs. Raw PDF binary wastes tokens and produces poor results.

Pattern: Parse on client with WebSocket → send markdown to your API → call LLM.

See SDK documentation for React hooks and advanced usage.

Self-Hosting

See Self-Hosting Guide for deployment instructions.

Requirements: Docker, Bun

Architecture

License

MIT - See LICENSE for details.

Contact

For API access, on-premise deployment, or questions: adammajcher20@gmail.com

Name		Name	Last commit message	Last commit date
Latest commit History 66 Commits
.claude		.claude
.github/workflows		.github/workflows
.vscode		.vscode
apps		apps
docker/paddleocr		docker/paddleocr
docs		docs
examples		examples
packages		packages
spec		spec
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
.oxfmtrc.jsonc		.oxfmtrc.jsonc
.oxlintrc.json		.oxlintrc.json
LICENSE		LICENSE
README.md		README.md
bun.lock		bun.lock
docker-compose.prod.yml		docker-compose.prod.yml
docker-compose.yml		docker-compose.yml
lefthook.yml		lefthook.yml
package.json		package.json
tsconfig.json		tsconfig.json
turbo.json		turbo.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ocrbase

Features

Quick Start

React Integration

LLM Integration

Self-Hosting

Architecture

License

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

lizzies/ocrbase

Folders and files

Latest commit

History

Repository files navigation

ocrbase

Features

Quick Start

React Integration

LLM Integration

Self-Hosting

Architecture

License

Contact

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages