(optional) Click 'Convert to Word Document' in the upper right corner. Browse to the location of the PDF file and select it as you would any. Select a PDF, it will open in our PDF editor. Use our Optical Character Recognition (OCR) tool to quickly spot editable text within any scanned document or image file. Advanced systems capable of producing a high degree of recognition accuracy for most fonts are now common, and with support for a variety of digital image file format inputs. To convert a PDF into a Word document, start Word and perform the Open File process. OCR is a field of research in pattern recognition, artificial intelligence and computer vision.Įarly versions needed to be trained with images of each character, and worked on one font at a time. Widely used as a form of data entry from printed paper data records – whether passport documents, invoices, bank statements, computerized receipts, business cards, mail, printouts of static-data, or any suitable documentation – it is a common method of digitizing printed texts so that they can be electronically edited, searched, stored more compactly, displayed on-line, and used in machine processes such as cognitive computing, machine translation, (extracted) text-to-speech, key data and text mining.
Word makes a copy of the PDF, converting it to a Word document and attempting to match. Optical character recognition or optical character reader (OCR) is the electronic or mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a scene-photo (for example the text on signs and billboards in a landscape photo) or from subtitle text superimposed on an image (for example from a television broadcast). However, if you want to edit the PDF file, go ahead and open it in Word.