How to Convert Scanned PDF with PDF to Word ++ (PDF to Word OCR)
PDF to Word ++ is an easy-to-use PDF Converter with OCR ability, which can converts both electronic and scanned PDF documents into editable and well-formatted Word document (.docx) and Plain Text (.txt). It can preserves original content, layout and formatting after conversion, reducing the need to retype the document manually or get rid of the tedious copy-paste job.
Open Finder > Applications and click on 'PDF to Word ++' icon. When the interface shows up, click 'Add Files' button. A finder window will slide down, and then you can select the PDF files that you wish to convert (Unlimited amount of files can be selected).
You can also drag files to the app to import files alternatively.
Word (.docx) and Text (.txt) are available in the drop-down list. If you want to apply this setting to all imported PDF files, please check 'Apply to all imported files'.
(* OCR Option is for converting scanned PDF file, if your PDF contains editable text content, you don't need to select it.)
You can convert all pages and any selected pages. To convert selected pages only, type in the page number or page range in the blank, e.g. 1,3-5,10, no space, use comma to separate numbers.
Click 'Browse' to select desired output folder to store the converted documents. After conversion, you can click 'Open' to open the output folder directly.
If you leave the blank, the application will also prompts you to save the file in selected output path when you click 'Convert' button.
Click 'Convert' button to start conversion when you finished the setting above. A progress bar will show up.
You can click on the link to open converted Word or Text document directly after conversion.
When you convert a scanned PDF without performing OCR, the whole content will be converted into an image instead of editable text. You are not able to modify anything except moving the image in Word document.
And you'll get an notice message when conversion finished.
To convert a scanned PDF into editable content, please follow these steps.
Click 'OCR Option' button, check 'Perform OCR' option.
You need to select the appropriate document language prior to OCR conversion. This is extremely important step to get accurate text recognition result.
The application supports 10 languages, including English, French, German, Italian, Spanish, Portuguese, Polish, Swedish, Russian and Dutch.
If you need to apply the OCR setting to all imported files, check 'Apply to all'.
Incorrect orientation of the document will result in poor conversion quality.
Move your mouse cursor to the left top of the built-in PDF reader, you'll see rotate buttons appear. Rotate operation only affect current page.
Extracting text is the main purpose of performing OCR, if the scanned PDF contains images elements, you need to select them prior to the conversion for better formatting preservation and accuracy.
(1) To select image areas, move your mouse cursor to the built-in reader, hold left-click and drag to select area. And then release the mouse.
(2) To move or adjust the area, click on it and drag the area border to the desired location.
(3) To remove selected area, simply select and press 'Delete' button on your keyboard, or move your mouse cursor to the left top of the built-in PDF reader, you'll see 'remove' buttons appear. You can remove single selected areas, or all the selected areas in this document.
Selected area will be preserved as an image in converted Word document and the app will not perform OCR for the select areas. By doing this, you can keep the original layouts better. If you don't select image area, text on image will also be OCRed, but the image will be missing in output document.
When OCR is performing, it will take a longer time than normal PDF conversion. Please be patient.
When conversion finished, you will get an editable Word document instead of image only.
Note: OCR is not an easy task, both the quality of the source PDF and OCR option affect the quality and accuracy of the output file.
Remember to check the spelling, content & numbers after conversion.
Tips: How to Improve OCR Conversion Quality? >>>