gscan2pdf logo

gscan2pdf

Enables scanning, cleaning, and optical character recognition on images and documents, generating searchable PDF and DjVu files.

Made by Jeffrey Ratcliffe

  • pdf

  • djvu-converter

  • djvu

  • text-recognition

What is gscan2pdf?

The application offers versatile scanning and image processing capabilities, enabling users to capture, clean, and perform optical character recognition (OCR) on scanned documents, existing PDFs, DjVus, and other file types. With the integration of various OCR engines, including Tesseract, Ocropus, Cuneiform, and GOCR, the software can generate PDF and DjVu files with embedded OCR text, providing users with a comprehensive solution for digitizing and managing their physical documents

Highlights

  • Scan, clean, and perform OCR on scanned documents, PDFs, DjVus, and other file types
  • Support for multiple OCR engines (Tesseract, Ocropus, Cuneiform, GOCR)
  • Create PDF and DjVu files with embedded OCR text
  • Ability to process a variety of file formats

Platforms

  • Linux

Languages

  • English

Features

    • PDF OCR

    • OCR

    • Scan to PDF

    • Split and merge PDF files

    • Screenshot OCR

    • Export to PDF