G2i AI Hub

A secure, extensible platform for AI services including document processing, agents, and more. Built with FastAPI to provide a scalable architecture for AI-powered features.

Features

Document Processing: Extract structured content from PDFs, Word documents, and more
AI Agents: (Coming Soon) Autonomous agents for specialized tasks
Processing Chains: (Coming Soon) Multi-step AI workflows
Knowledge Indexing: (Coming Soon) Lemma and concept indexing

Getting Started

Installation

# Clone the repository
git clone <repository-url>
cd api-proxy

# Install dependencies
pip install -r requirements.txt

# Set up environment variables
cp .env.example .env
# Edit .env to add your API_KEY

Running

# Run the development server
python main.py

Environment Configuration

Create a .env file with these variables:

# Required for API authentication
API_KEY=your_api_key_here

# Optional (defaults shown)
DOCLING_API_URL=http://docling-serve-cpu.railway.internal:3000
DOCLING_SERVICE_NAME=docling-serve-cpu
DOCLING_SERVICE_PORT=3000

API Reference

Base URL: https://ai.g2i.co/api/v1

Public Endpoints (No Authentication)

Health Check: GET /health - Verify the service is running
API Documentation: GET /docs - Interactive API documentation
Agents (Coming Soon): GET /agents - List available AI agents

Authenticated Endpoints (Bearer Token Required)

All document processing endpoints require authentication with:

Authorization: Bearer <api_token>

Document Processing

Convert URL: POST /document/convert/source - Process documents from URLs
Convert File: POST /document/convert/file - Process uploaded document files
Async Processing: POST /document/convert/source/async - Start async document conversion
Check Status: GET /document/status/poll/{task_id}?wait={seconds} - Poll for status updates
Get Results: GET /document/result/{task_id} - Retrieve conversion results

Usage Examples

Document Processing (Authenticated)

# Process a document from URL
curl -X 'POST' \
  'https://ai.g2i.co/api/v1/document/convert/source' \
  -H 'Content-Type: application/json' \
  -H 'Authorization: Bearer <api_token>' \
  -d '{
    "http_sources": [{"url": "https://arxiv.org/pdf/2206.01062.pdf"}],
    "options": {
      "to_formats": ["json"],
      "from_formats": ["pdf"],
      "image_export_mode": "embedded",
      "do_ocr": true
    }
  }' \
  --output 'output.zip'

Agents API (No Authentication)

# List available agents
curl -X 'GET' 'https://ai.g2i.co/api/v1/agents'

Document Processing Options

Option	Description	Default
`to_formats`	Output formats (json, md, html, text, doctags)	["md"]
`from_formats`	Input formats to process	["pdf"]
`image_export_mode`	How to handle images (embedded, placeholder, referenced)	"embedded"
`do_ocr`	Enable OCR for images	true
`ocr_engine`	OCR engine to use (easyocr, rapidocr, tesseract)	"easyocr"
`pdf_backend`	PDF parsing backend	"dlparse_v4"
`return_as_file`	Return as ZIP file instead of JSON	false

For a complete list of options, see the API documentation.

Project Structure

app/
├── api/              # API routes by version
├── core/             # Configuration and utilities
├── middleware/       # Authentication middleware
├── models/           # Data models/schemas
├── services/         # Service integrations
├── utils/            # Utility functions
└── app.py            # Application entry point

Deployment

The application uses Railway Nixpacks for deployment. Configuration is in railway.json.

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
app		app
.DS_Store		.DS_Store
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
CELERY_TROUBLESHOOTING.md		CELERY_TROUBLESHOOTING.md
CLAUDE.md		CLAUDE.md
LICENSE.md		LICENSE.md
README.md		README.md
main.py		main.py
pyproject.toml		pyproject.toml
railway-celery.json		railway-celery.json
railway.json		railway.json
requirements.txt		requirements.txt
run_celery_worker.sh		run_celery_worker.sh
screenshot.png		screenshot.png
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

G2i AI Hub

Features

Getting Started

Installation

Running

Environment Configuration

API Reference

Public Endpoints (No Authentication)

Authenticated Endpoints (Bearer Token Required)

Document Processing

Usage Examples

Document Processing (Authenticated)

Agents API (No Authentication)

Document Processing Options

Project Structure

Deployment

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

License

g2i/ai-hub

Folders and files

Latest commit

History

Repository files navigation

G2i AI Hub

Features

Getting Started

Installation

Running

Environment Configuration

API Reference

Public Endpoints (No Authentication)

Authenticated Endpoints (Bearer Token Required)

Document Processing

Usage Examples

Document Processing (Authenticated)

Agents API (No Authentication)

Document Processing Options

Project Structure

Deployment

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages