Image Processing Defaults

This Help topic refers to the following editions:

þ Enterprise þProfessional þ Personal þ Small Business

 

From the DocuXplorer Desktop Ribbon Tab Home | Options | Image Processing Defaults item create the default settings for Scanning Preview Window display.

Image Processing defaults are workstation specific. Options on this screen can be changed by the user, no administrative access is required.

 

 

 

 

Quick Scan Options

Show Twain Interface - this option when checked will force DocuXplorer to display the scanner's Twain interface when scanning new pages.

Display Scanned Document Dialog after scanning completes - check box to display the Scanned Document dialog box only after scanning completes to view the scan quality and add additional pages.

Convert scanned documents to PDF - by selecting the check box all scanned documents using the Quick Scan option will automatically be converted to PDF format. DocuXplorer will not automatically convert scanned images to PDF when using the Scanner/Camera option. When using the Scanner/Camera options Save and Close the image, right-click and select <Convert Document to: PDF>.

 

Optical Character Recognition

OCR Zone data file location - This file holds the information for OCR templates that are created for automated data entry processing. If the file is placed on a network drive make sure users have read and write access to it.

OCR Engine Type - The OCR engine can work at varying speeds Fast, Medium and Slow. Fast is the default and produces results that are acceptable for most document types. Medium and Slow, as the text implies work slower but with higher degree of accuracy.

OCR Segmentation Mode - The OCR engine can work with varying layout of on a page. By default the OCR engine expects a page of text when it segments an image. If you're just seeking to OCR a small region try a different segmentation mode.  Note that adding a white border to text which is too tightly cropped may also help

OCR Engine - Generally, LTSM should always be used, providing significant speed and accuracy benefits:

https://github.com/tesseract-ocr/tesseract/wiki/4.0-Accuracy-and-Performance

 

OCR Language - This option allows a user to change the default language that is used to perform OCR text extraction on documents. If this option is left blank (the default), then the selected interface language is used for the OCR process.

 

Image Display Settings

Rubber Stamp data file location - This file holds the rubber stamps information defined when editing tiff documents. If the file is placed on a network drive make sure users have read and write access to it.