The PDF format was originally intended to display the exact same content and layout regardless of operating system, device, or software application it is viewed on. Nowadays however, it has become a necessity to be able to search through PDF documents, extract information or convert complete documents into editable formats. TThis is not always easy, especially in the case of PDFs created by scanning – “scanned” or "image-only" PDFs. PDF conversion tools must incorporate OCR technology to enable search, extraction and repurposing of information.
Optical Character Recognition (OCR) or text recognition unlocks the information “trapped” in a scanned/photographed image of a document. OCR software such as ABBYY FineReader PDF “reads” the content of a document (text and structure) by interpreting character images and assigning them an electronic equivalent. This makes it possible to transfer the content and layout of the document into searchable and editable formats.