High quality mobile recognition
The ABBYY Mobile OCR Engine is based on ABBYY’s world-renowned OCR technology – providing highly accurate text recognition from mobile devices. Its sophisticated functionality includes:
- Automatic image skew correction. Photographs taken with a mobile device often suffer image skew, which negatively impacts recognition quality. The Engine enables detection and correction of skews within one degree of precision (the maximum detectable skew angle is 16 degrees), significantly improving the quality and accuracy of mobile OCR.
- Document orientation detection function. Image pre-processing automatically detects the orientation of a page.
- Hyphenation support. If the engine encounters part of a hyphenated word on one line and the second part on the next line, it will join them into one word.
- Confidence level indicator. This function shows the level of certainty for recognized text, allowing developers to set flexible criteria for implementation of proofreading and verification functions.
- Improved data analysis algorithm. The Engines’ image-processing algorithm enables data analysis that discards all unnecessary information in an image, boosting recognition accuracy.
- Spell checking during text recognition considerably improves the quality of output.
- Speed up during binarization stage. A new binarization algorithm speeds the processing time of small documents in European languages (for example, business cards) up to 10-15%. Binarization itself is seven to eight times faster.
- Zonal OCR, which enables applications to recognize text blocks that have been manually set up on an image.
Low resource requirements
The ABBYY Mobile OCR Engine is based on compact code OCR technology and is optimized to work with devices that have small memory sizes – including smartphones, tablets and portable scanners. Features include:
Upgraded memory management. A new algorithm for memory management enables the software to determine the exact memory size required to process an image. This eliminates the need to allocate significant memory segments in advance, which can impact recognition speed and reliability – ensuring efficiency and fast performance speed.
The Engine’s code is very compact. It occupies as little as 8 MB of ROM and 10 MB of RAM depending on desired functionality.
Recognition operations are performed in parallel. The number of threads used for recognition is equal to four by default. It is possible to set up custom number of threads. Speed enhancements will be most noticeable for documents containing many lines of text.
Business card recognition
The Engine also processes business card images obtained via mobile device cameras or portable scanners. This allows retrieval of information including first name, last name, title, phone number, e-mail, address, etc. Plus, the technology can recognize cards in 26 languages.
Recognition of most popular 1D and 2D barcodes is supported. See The Full List of Supported Barcodes. Several barcodes on an image can be detected.
Sixty two recognition languages
Text recognition is supported for 62 languages. This includes:
Innovative data analysis algorithm
The data analysis algorithm of the Engine enables recognition and retention of the source document’s original formatting in the output text:
Preserves multi-column text. The Engine’s Paragraph Assembly function identifies text-block borders, recognizing each block separately – preserving the format of a multi-column text, paragraphs, and text segments.
Preserves Character Fonts. The Engine identifies the font properties of source text – bold, italic or underlined.
Two recognition modes
One of two mobile recognition modes can be selected:
Fast mode. When an image is of good quality, this mode cuts the time required for recognition and processing.
Full mode. Best for low-quality images, when more time is required to achieve optimal results.