Introduce PDF to Word ++ (With OCR ability)
PDF to Word ++ is an easy-to-use PDF Converter with OCR ability, which can converts both electronic and scanned PDF documents into editable and well-formatted Word document (.docx) and Plain Text (.txt). It can preserves original content, layout and formatting after conversion, reducing the need to retype the document manually or get rid of the tedious copy-paste job.
Working with PDF to Word ++
1. Launch and add files:
Open Finder > Applications and click on ‘PDF to Word ++’ icon. When the interface shows up, click the ‘Add Files’ button. A finder window will slide down, and then you can select the PDF files that you wish to convert (Unlimited amount of files can be selected).
You can also drag files to the app to import files alternatively.
add pdf file
2. Select Output Options
* Select output formats
Word (.docx) and Text (.txt) are available in the drop-down list. If you want to apply this setting to all imported PDF files, please check ‘Apply to all imported files’.
(* OCR Option is for converting a scanned PDF file, if your PDF contains editable text content, you don’t need to select it.)
* Select pages to convert
You can convert all pages and any selected pages. To convert selected pages only, type in the page number or page range in the blank, e.g. 1,3-5,10, no space, use comma to separate numbers.
* Select output folder
Click ‘Browse’ to select the desired output folder to store the converted documents. After conversion, you can click ‘Open’ to open the output folder directly.
If you leave them blank, the application will also prompt you to save the file in the selected output path when you click ‘Convert’ button.
3. Start conversion
Click ‘Convert’ button to start conversion when you finished the setting above. A progress bar will show up.
You can click on the link to open converted Word or Text document directly after conversion.
When you convert a scanned PDF without performing OCR, the whole content will be converted into an image instead of editable text. You are not able to modify anything except moving the image in Word document.
And you’ll get a notification message when conversion finished.
To convert a scanned PDF into editable content, please follow these steps.
1. Activate OCR ability in ‘OCR Option’
Click the ‘OCR Option’ button, check the ‘Perform OCR’ option.
2. Select the document language
You need to select the appropriate document language prior to OCR conversion. This is an extremely important step to get accurate text recognition result.
The application supports 10 languages, including English, French, German, Italian, Spanish, Portuguese, Polish, Swedish, Russian and Dutch.
If you need to apply the OCR setting to all imported files, check ‘Apply to all’.
3. Rotate pages to the correct orientation
Incorrect orientation of the document will result in poor conversion quality.
rotate pages for ocr conversion
Move your mouse cursor to the left top of the built-in PDF reader, you’ll see rotate buttons appear. Rotate operation only affects the current page.
4. Select image areas
Extracting text is the main purpose of performing OCR, if the scanned PDF contains images elements, you need to select them prior to the conversion for better formatting preservation and accuracy.
(1) To select image areas, move your mouse cursor to the built-in reader, hold left-click and drag to select an area. And then release the mouse.
(2) To move or adjust the area, click on it and drag the area border to the desired location.
(3) To remove a selected area, simply select and press ‘Delete’ button on your keyboard, or move your mouse cursor to the left top of the built-in PDF reader, you’ll see ‘remove’ buttons appear. You can remove single selected areas, or all the selected areas in this document.
mark image area
The selected area will be preserved as an image in converted Word document and the app will not perform OCR for the select areas. By doing this, you can keep the original layouts better. If you don’t select image area, text on the image will also be OCRed, but the image will be missing in the output document.
image areas ocr conversion
5. Click ‘Convert’ to start OCR conversion.
When OCR is performing, it will take a longer time than normal PDF conversion. Please be patient.
When the conversion finished, you will get an editable Word document instead of image only.
Note: OCR is not an easy task, both the quality of the source PDF and OCR option affect the quality and accuracy of the output file.
Remember to check the spelling, content & numbers after conversion.
How can you distinguish scanned PDF from a normal PDF file? >>