What is contentCrawler?
contentCrawler is an integrated analysis, processing, and reporting framework that assesses documents in a document management system (DMS) for bulk processing. The solution can intelligently evaluate image-based documents in content repositories and convert them to text-searchable PDFs using optical character recognition (OCR) capabilities. Additionally, the Compression module can apply compression and downsampling to all PDFs, reducing their file size. This automated end-to-end process can run 24/7 without any staff intervention, providing periodic notifications of processing statistics and error reporting to the IT administrator. Users no longer have to worry about managing OCR or compression as a separate process or workflow, as contentCrawler handles these tasks seamlessly within the DMS
Highlights
- Intelligent assessment of documents in a DMS for bulk processing
- Optical character recognition (OCR) to convert image-based documents to text-searchable PDFs
- Compression and downsampling of PDFs to reduce file size
- Automated 24/7 processing with periodic status updates and error reporting
- Eliminates the need for staff to manage OCR or compression as a separate process
Features
Document Management
OCR
Image Processing
File Compression
Image Optimizer
DMS
Image to text
PDF compression