ABBYY FineReader Engine 9.0 for Linux > Key Features

The Engine’s API delivers intelligent structural recognition of document layout, including autodection of text, images, barcodes and pictures, text orientation detection. It also supports special document analysis  designed to be used in classification and capture applications, including invoice processing, tables analysis and vertical text detection.

Document Analysis

The document analysis set of functions of the FineReader Engine API delivers automatic document conversion with full-page layout retention, zoning OCR with manually located blocks and more. It includes:

Special document analysis functions of the ABBYY OCR SDK include:

Document Analysis for Invoices

This is a pre-processing engine for converting semi-structured documents, such as invoices, payment drafts, bills, waybills, business cards, agreements, health claim forms, resumes, etc. It has been designed to accurately locate all the text on these documents, including characters and numbers — even if this information is located within stamps, pictures, logos or small-text areas.

Unlike the standard full-page document analysis, this one assumes that all printed information on documents is text. It also ensures that important text information is not identified as graphic elements and words or numerical values are not separated into multiple characters. As a result, maximum information about the text, including its coordinates, is available for analysis, field-by-field processing and parsing at subsequent processing stages by other systems.

Document Analysis for Invoices is used in FlexiLayout Studio as a first step of semi-structured document analysis, helping to extract data from unstructured forms and documents with similar data but different layouts.

Document Analysis for Full-Text Indexing

Automatically detects and recognizes all text on documents including text embedded in pictures, charts, and diagrams. Developers may choose to use this mode of document analysis to extract exhaustive full-text information on documents needed for document index building (as in DMS, CMS, Archiving systems).

Field-Level/Zonal Recognition

ABBYY FineReader Engine 9.0 delivers complete field-level/zonal recognition capabilities to support key business processes such as forms processing, keyword classification, and keyword indexing. Powerful image processing functions increase its ability to intelligently detect small zone areas of any quality, with any type of graphic nuances which may affect the recognition accuracy (i.e. underlined text, after-scanning garbage, spaces in the text, etc.)
Key functionality for field-level or zonal recognition includes multilingual OCR and ICR, OMR, barcode recognition and a range of specific functions, such as:

  • Data extraction from fields with various borders and frames, including combo-box, underlined fields, boxes, and even fields where the data does not fit within the field border
  • Definition of field content by setting alphabets, dictionaries, regular expressions, types of segmentations, handwriting styles, etc
  • Detection of in-field spacing, accurately recognizing fields where the spaces are allowed. FineReader Engine 9.0 also allows use of dictionaries which contain word combinations with spaces 
  • Intelligent processing of blocks with intersecting parts and lines, provides recognition of text (words and symbols) located entirely within the block borders, saving time spent on non-relevant text block recognition 
  • Text block despeckle, with the ability to specify the size of white or black "garbage"

Field-level/zonal recognition is supported by the Engine’s special tools for developers such as Voting API and "On-the-Fly" Recognition Tuning. For details, please see Advanced Development Tools.


Other ABBYY FineReader Engine 9.0 for Linux Features:

Image Import
Image Processing
Layout Reconstruction
Language Support
Barcode Types
PDF Conversion
Advanced Development Tools
Output Options



Please enter your name and e-mail in the form below:
First Name:
Last Name:
E-mail:

*Your email address will be used to send information about the product purchase, news and updates only. Your email address will not be sold, rented or shared with other parties.