Click2Scan Selects ABBYY UK as Strategic Partner for Data Capture ABBYY Europe and Click2Scan Ltd of the UK announce the successful integration of ABBYY FlexiCapture data ... more >> |
ABBYY FineReader Engine 9.0 for Linux introduces Adaptive Document Recognition Technology (ADRT), a core set of document synthesis algorithms which reproduces general logical structure of document. It automatically builds a logical model of the document structure and identifies its elements. ADRT defines
The document analysis set of functions of the FineReader Engine API delivers automatic document conversion with full-page layout retention, zoning OCR with manually located blocks and more. It includes:
Special document analysis functions of the ABBYY OCR SDK include:
This is a pre-processing engine for converting semi-structured documents, such as invoices, payment drafts, bills, waybills, business cards, agreements, health claim forms, resumes, etc. It has been designed to accurately locate all the text on these documents, including characters and numbers — even if this information is located within stamps, pictures, logos or small-text areas.
Unlike the standard full-page document analysis, this one assumes that all printed information on documents is text. It also ensures that important text information is not identified as graphic elements and words or numerical values are not separated into multiple characters. As a result, maximum information about the text, including its coordinates, is available for analysis, field-by-field processing and parsing at subsequent processing stages by other systems.
Document Analysis for Invoices is used in FlexiLayout Studio as a first step of semi-structured document analysis, helping to extract data from unstructured forms and documents with similar data but different layouts.
Automatically detects and recognizes all text on documents including text embedded in pictures, charts, and diagrams. Developers may choose to use this mode of document analysis to extract exhaustive full-text information on documents needed for document index building (as in DMS, CMS, Archiving systems).
ABBYY FineReader Engine 9.0 delivers complete field-level/zonal recognition capabilities to support key business processes such as forms processing, keyword classification, and keyword indexing. Powerful image processing functions increase its ability to intelligently detect small zone areas of any quality, with any type of graphic nuances which may affect the recognition accuracy (i.e. underlined text, after-scanning garbage, spaces in the text, etc.)
Key functionality for field-level or zonal recognition includes multilingual OCR and ICR, OMR, barcode recognition and a range of specific functions, such as:
Field-level/zonal recognition is supported by the Engine’s special tools for developers such as Voting API and "On-the-Fly" Recognition Tuning. For details, please see Advanced Development Tools.