RFI Vector Search System

Completing RFI's are extremely boring. No one likes to do them. This system is designed to help you complete RFI's faster and more efficiently. You can then focus on half finishing cool projects and playing with new technologies.

A local RAG (Retrieval-Augmented Generation) system for searching through RFI documents, policies, SOPs, and other technical documentation using Qdrant vector database.

If it works, please reach out and thank me. If it doesn't, well it worked on my machine.

Features

Local Embeddings: Uses SentenceTransformers (all-MiniLM-L6-v2) for semantic search
GPU Acceleration: Optimized batch processing with CUDA support when available
Multiple File Formats: Supports PDF, DOCX, XLSX, TXT, MD
FastAPI Server: RESTful API with automatic documentation
Qdrant Vector Database: High-performance vector storage with Docker deployment
Semantic Search: Find documents by meaning, not just keywords
Stable Dependencies: Tested with PyTorch 1.13.1, transformers 4.21.3

Quick Start

1. Setup Required Directories

The system requires specific directory structure. Ensure you have:

# Create the main project directory structure
mkdir -p existing_files/{Policies,"RFIs and Other",SOPs,Statements,"Support Docs"}

2. Install Dependencies

# Create virtual environment
python3 -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

# Install stable dependencies
pip install -r requirements.txt

3. Start Qdrant Database

# Start Qdrant with Docker (includes health checks)
./start_qdrant.sh

This will:

Start Qdrant vector database on localhost:6333
Wait for the database to be ready
Provide access to Qdrant UI at http://localhost:6333/dashboard

4. Process Documents

python setup_qdrant.py

This will:

Process all files in existing_files/
Create embeddings using CPU/GPU (auto-detected)
Store vectors in Qdrant database
Show statistics about processed documents

5. Start API Server

python api_server_qdrant.py

Server runs at: http://localhost:8000 API docs at: http://localhost:8000/docs

API Usage

Simple Search

curl "http://localhost:8000/search/simple?query=GDPR+compliance&limit=3"

Advanced Search

curl -X POST "http://localhost:8000/search" \
  -H "Content-Type: application/json" \
  -d '{
    "query": "data security policies",
    "top_k": 5,
    "min_score": 0.7,
    "category_filter": "Policy"
  }'

Get Statistics

curl "http://localhost:8000/stats"

Health Check

curl "http://localhost:8000/health"

Using with Claude Code

Important: Directory Context Required

You MUST run Claude Code from this project directory for Claude to discover and use the API:

# Navigate to the project directory
cd /path/to/rfi_vector_search

# Start Claude Code from this directory
claude

# OR use the full path
claude /path/to/rfi_vector_search

How Claude Code Discovers the API

Claude Code doesn't automatically know about your API. It discovers it through:

CLAUDE.md: Contains system documentation and API instructions
README.md: Documents available endpoints and usage
Source Code: Can read api_server_qdrant.py to understand the API
Running Processes: Can detect the server running on port 8000

API Usage in Claude Sessions

Once the server is running and you're in the correct directory, Claude can search your documents:

# Search for compliance-related documents
curl "http://localhost:8000/search/simple?query=ISO+27001+compliance"

# Find specific policies
curl "http://localhost:8000/search/simple?query=data+retention+policy"

# Search SOPs
curl "http://localhost:8000/search/simple?query=incident+response+procedure"

Key Requirements for Claude Code Integration

✅ Run from project directory: Claude needs access to CLAUDE.md and documentation
✅ Server must be running: python api_server_qdrant.py in background
✅ Qdrant must be running: ./start_qdrant.sh executed first
✅ Documents processed: python setup_qdrant.py completed successfully

Note: Claude uses standard HTTP requests via curl - the same commands you would use manually. It's not a special integration, just well-documented local API usage.

Document Categories

The system automatically categorizes documents:

RFI: Files in "RFIs and Other/"
Policy: Files in "Policies/"
SOP: Files in "SOPs/"
Statement: Files in "Statements/"
Support Doc: Files in "Support Docs/"

Required Directory Structure

The system expects the following directory structure to be in place:

rfi_vector_search/
├── existing_files/                    # Source documents directory
│   ├── Policies/                      # Policy documents (POL-*.docx)
│   ├── RFIs and Other/               # RFI documents and other files
│   ├── SOPs/                         # Standard Operating Procedures (SOP-*.docx)
│   ├── Statements/                   # Compliance statements (STAT-*.docx)
│   └── Support Docs/                 # Additional support documentation
├── qdrant_data/                      # Qdrant vector database storage (Docker volume)
├── venv/                            # Python virtual environment
├── __pycache__/                     # Python bytecode cache
├── qdrant_processor.py              # Core document processing logic (Qdrant)
├── api_server_qdrant.py             # FastAPI web server (Qdrant)
├── setup_qdrant.py                  # Initial setup and document processing
├── start_qdrant.sh                  # Qdrant startup script
├── docker-compose.yml               # Qdrant Docker configuration
├── requirements.txt                # Python package dependencies
├── CLAUDE.md                       # Claude Code guidance
└── README.md                       # This documentation

Directory Requirements

existing_files/: This is the main source directory containing all documents to be processed. The system automatically categorizes documents based on their subdirectory:

Policies/: Contains policy documents (POL-*.docx files)
RFIs and Other/: Contains RFI documents, templates, and other miscellaneous files
SOPs/: Contains Standard Operating Procedures (SOP-*.docx files)
Statements/: Contains compliance and regulatory statements (STAT-*.docx files)
Support Docs/: Contains additional support documentation

qdrant_data/: Docker volume for Qdrant vector database storage:

Vector embeddings of processed documents
Qdrant database files and indexes
Search metadata and configuration

venv/: Python virtual environment (created during setup) pycache/: Python bytecode cache (auto-generated)

Troubleshooting

Common Issues

Import Errors: Ensure virtual environment is activated and dependencies installed
No Results: Check that documents were processed successfully with setup_qdrant.py
Server Won't Start: Check port 8000 isn't already in use
Qdrant Connection: Ensure Docker is running and Qdrant container is healthy
GPU Issues: System falls back to CPU automatically if CUDA unavailable

Dependency Issues

If you encounter version conflicts, use these stable versions:

pip install torch==1.13.1 transformers==4.21.3 sentence-transformers==2.2.2 qdrant-client

Performance Tips

GPU Usage: System auto-detects CUDA and optimizes batch sizes for available VRAM
Large Files: System automatically limits processing for very large PDFs/Excel files
Memory: Clear GPU cache between large processing jobs if needed

Adding New Documents

Add files to existing_files/ directory
Run: curl -X POST "http://localhost:8000/process"
New documents will be processed and added to the database

Testing with Postman

Import the provided Postman collection (RFI_Vector_Search.postman_collection.json) to test all API endpoints. The collection includes 11 comprehensive tests:

Core Functionality Tests

Health Check (/health): Verify system status and Qdrant connection
Root Health Check (/): Basic API availability test
Get Statistics (/stats): Database statistics and document counts

Search Tests

Simple Search - Compliance: Test basic search functionality
Simple Search - Data Security: Search for specific policy types
Advanced Search - High Scored Results: Test minimum score filtering
Advanced Search - Category Filter: Test category-specific searches
Advanced Search - SOPs: Search Standard Operating Procedures

Document Processing

Process Documents (/process): Test document ingestion endpoint

Edge Case Tests

Empty Query Handling: Test validation for empty search queries
Large Limit Test: Test behavior with large result limits

Features

Automated Testing: Each request includes test scripts to verify responses
Environment Variables: Uses {{base_url}} variable (default: http://localhost:8000)
Response Validation: Tests check response structure, data types, and business logic
Error Handling: Tests verify proper error responses for edge cases

Usage

Import the collection into Postman
Ensure the API server is running (python api_server_qdrant.py)
Run individual tests or the entire collection
View test results in the Postman Test Results tab

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.claude		.claude
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
README.md		README.md
RFI_Vector_Search.postman_collection.json		RFI_Vector_Search.postman_collection.json
api_server.py		api_server.py
api_server_qdrant.py		api_server_qdrant.py
docker-compose.yml		docker-compose.yml
install_cuda.py		install_cuda.py
qdrant_processor.py		qdrant_processor.py
requirements.txt		requirements.txt
setup.py		setup.py
setup_qdrant.py		setup_qdrant.py
start_qdrant.sh		start_qdrant.sh
start_server.sh		start_server.sh
test_gpu.py		test_gpu.py
test_system.py		test_system.py

DominicFinn/rfi_vector_search

Folders and files

Latest commit

History

Repository files navigation

RFI Vector Search System

Features

Quick Start

1. Setup Required Directories

2. Install Dependencies

3. Start Qdrant Database

4. Process Documents

5. Start API Server

API Usage

Simple Search

Advanced Search

Get Statistics

Health Check

Using with Claude Code

Important: Directory Context Required

How Claude Code Discovers the API

API Usage in Claude Sessions

Key Requirements for Claude Code Integration

Document Categories

Required Directory Structure

Directory Requirements

Troubleshooting

Common Issues

Dependency Issues

Performance Tips

Adding New Documents

Testing with Postman

Core Functionality Tests

Search Tests

Document Processing

Edge Case Tests

Features

Usage

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages