ABBYY Artificial Intelligence
Purpose-Built AI Center

At the heart of ABBYY solutions we employ a combination of technologies to deliver best-in-class intelligent document processing (IDP).
Innovative AI is built into ABBYY’s IDP platform in all steps within the intelligent document processing pipeline, from image enhancement to object detection, OCR/ICR, classification, extraction from semi-structured documents, and extraction from unstructured documents.
Using the right combination of technologies and techniques, ABBYY IDP solutions can process any kind of document—any format, any language, any structure. All our specialized techniques have been optimized for the best possible inferences and the least amount of resources required so they can have optimal cost and deliver the –greatest ROI for our customers.
Loading component...
Cutting-edge AI tools powering ABBYY’s
purpose-built solutions
A combination of highly optimized for the task AI models and algorithms.
Loading component...
Loading component...
OCR & ICR – optical character recognition and handwriting recognition
ABBYY is a pioneer in optical character recognition technology, actively researching and innovating in this area since 1993, when our first “omnifont OCR system” ABBYY FineReader was launched to the market. Over the years, the technology has evolved from recognizing individual characters, identifying words, and reproducing page structure, to applying adaptive document recognition technology (ADRT®) that understands documents in their entirety, including layout, multi-page structure, and elements such as header, footer, and table of contents.
With the advancements of AI, ABBYY has developed and solidified its end-to-end approach to OCR and ICR in the last several years. This approach uses the same technologies that are the basis of generative AI tools—convolutional neural networks, transformers, and language models.
The convolutional neural network breaks apart an image of handwritten or printed text on a document into its bits and bytes, trying to make sense of what it actually is. All that input from the CNN then goes into a transformer to provide a potential outcome of a word. Then, we introduce our very own LM, which is trained on billions of parameters, with the specific function of being able to take the context of all of the different words in a group and make the best use of that info to come to a conclusion. This technique drastically improves the performance and accuracy of our OCR and ICR capabilities overall, and it is leveraged in combination with our statistical approach. Our AI will automatically decide which approach is best fit for your document use cases to optimize on the fly for consistency, accuracy, and speed, leading to better straight-through-processing rates.




















