About the OCR Plug-in

The text of image pages cannot be selected or copied directly. Use the OCR plug-in to convert it into text data that allow you to search and copy.
With the OCR plug-in, the following files can be OCRed:
  • Image pages from DocuWorks files
  • PDF documents
Multiple files can be selected and processed. DocuWorks files and PDF documents can also be selected simultaneously.
This section describes the built-in OCR program.
If the OCR engine is not installed at the time of installation, the built-in OCR cannot be used in the OCR plug-in.
OCR enables you to perform the following operations:
  • You can specify to rotate the documents so that the text can be read before performing OCR. You can also specify to rotate the documents without performing OCR.
  • You can select to perform noise reduction and deskewing so that the characters will be more recognizable during OCR. Note that the result of noise reduction and deskewing are not reflected in the processed document.
  • You can display the progress of OCR processing.
  • You can specify the recognition area.
  • When a color or a grayscale image is processed, you can specify to place priority on either the recognition rate or the speed. Two-color (monochrome) image pages are always processed with a priority on speed. The OCR processing with a priority on recognition is effective for recognizing outline characters or characters in pale colors, as well as the characters laid out on the background images. If it is the case, however, it may take more time to perform OCR processing than that with the priority of speed.
  • You can save the OCR results as the text format, RTF format, Excel format, or CSV format files.
Note
You can also use Viewer to perform the OCR processing. You can also use Viewer to edit the result of the OCR processing.