[OCR Setting] dialog box

Use this dialog box to specify the settings for OCR.
This dialog box appears when you click [Setting] in the [OCR] dialog box and you add OCR plug-in to the [Current Plug-in Menu] in the [Plug-in Setting] dialog box.

[Close dialog box when operation finishes successfully]

Closes the [OCR] dialog box when the OCR process finishes successfully.
By default, this box is cleared.

[Perform preprocessing only (Do not perform OCR process)]

Set whether to perform only preprocessing without performing OCR processing.
By default, this box is cleared.

[Preprocessing]

[Automatically Rotate Page]

Specifies to rotate scanned documents automatically in a readable orientation.
By default, this box is cleared.
Note
  • Preprocessing will not delete OCR-performed text.
  • You cannot rotate a document with annotations of Link, OLE, Date Stamp, Received Stamp, Title, Date/Time, Cloud Callout, or a grouped annotation attached.

[Option for OCR]

Configure the settings related to OCR. These processes are applied to images that are passed to OCR, and therefore they are not applied to the source files.
Note
If you select [Perform OCR processing in color] for [OCR processing for color image], [Reduce noise of image before OCR] and [Level] cannot be set.

[Reduce noise of image before OCR]

Specifies to reduce noise from images. For color image, reduce noise after binarized to monochrome.
By default, this box is selected.

[Level]

Specify the level of noise reduction. The default is [Normal].

[OCR processing for color image]

Specify whichever takes a priority between recognition and speed, when color and grayscale images are OCR processed.
[Convert to B&W and Prioritize Recognition] is effective for outline characters or characters in pale colors, as well as characters laid out on a background image. However, it may take more time in OCR-processing than [Convert to B&W and prioritize speed].
The image to be OCRed is scanned in color mode if you select [Perform OCR processing in color], and the scanned image is output in color if output format is set to [RTF (*.rtf)], [Excel (*.xlsx)], or [Word (*.docx)].
The default is [Convert to B&W and prioritize speed].

[OCR Detailed Settings]

Set up the detailed setting for OCR.
If the DocuWorks built-in OCR is used, the [OCR Advanced Settings] dialog box appears.

[Page to process]

Specify the page to perform OCR.
The default is [All pages].

[PDF Processing]

Select how to OCR the PDF documents. OCR results are written into a new PDF document to be created. They are not written into the source document.

[Convert all pages to images and perform OCR]

Generates a PDF file in which all pages within the PDF document have been imaged, and performs OCR processing for that file. The generated PDF file is assigned a name consisting of the original file name and a sub number.
If the PDF document to be processed contains the results of OCR processing previously performed, they will be lost.

[Perform OCR only on the image part in the page]

Extract images from the PDF document, and OCR those images. Text part will not be OCRed.
If the PDF document to be processed contains the results of OCR processing previously performed, new OCR processing results will be added.