dr-doc-search

Converse with book - Built with GPT-3

open-sourceagent-frameworks
Visit WebsiteView on GitHub
597
Stars
+50
Stars/month
0
Releases (6m)

Overview

dr-doc-search is a Python tool that enables conversational interaction with PDF documents using AI. Built on GPT-3 and supporting HuggingFace models as an alternative, it creates searchable embeddings from PDF content and allows users to ask natural language questions about the document. The tool handles both regular text PDFs and scanned documents through OCR integration with Tesseract. It operates in two phases: first creating an index with embeddings from the PDF content, then enabling interactive querying through a conversational interface. The tool extracts text, generates embeddings for semantic search, and provides AI-powered responses based on the document content. With 597 GitHub stars, it represents a practical approach to document intelligence, making large PDFs more accessible through natural language interaction rather than manual searching.

Pros

  • + Supports multiple AI backends including OpenAI GPT-3 and HuggingFace models for flexibility
  • + Handles both regular text PDFs and scanned documents through integrated OCR capabilities
  • + Simple CLI interface with clear two-step workflow for indexing and querying documents

Cons

  • - Requires external dependencies (Tesseract OCR and ImageMagick) which can complicate setup
  • - Limited to PDF format only, doesn't support other document types
  • - Two-step process requires separate training phase before use, adding workflow complexity

Use Cases

Getting Started

Install via pip install dr-doc-search and set up Tesseract OCR and ImageMagick dependencies. Set your OpenAI API key as an environment variable (or use HuggingFace alternative). Run dr-doc-search --train -i your-document.pdf to create the searchable index, then use the CLI to start asking questions about your document content.