Input File Formats
ABBYY Smart Classifier supports processing of the following document formats:
- RTF documents (*.rtf)
- Microsoft Word 97–2003 documents (*.doc)
- Microsoft Word documents (*.docx)
- Microsoft Word macro–enabled documents (*.docm)
- Microsoft Word XML documents (*.xml)
- Unformatted text files (*.txt) (We recommend saving text files in Unicode or UTF-8 with BOM)
- Web pages (*.html, *.htm)
- Microsoft PowerPoint 97–2003 presentations (*.ppt, *.pps)
- Microsoft PowerPoint presentations (*.pptx, *.ppsx)
- Microsoft PowerPoint macro–enabled presentations (*.pptm, *.ppsm)
- Microsoft PowerPoint XML presentations (*.xml)
- Microsoft Excel 97–2003 workbooks (*.xls)
- Microsoft Excel workbooks (*.xlsx)
- Microsoft Excel macro–enabled workbooks (*.xlsm)
- Adobe InDesign Markup Language (IDML) documents (*.idml)
- OpenDocument texts (*.odt)
- OpenDocument presentations (*.odp)
- OpenDocument spreadsheets (*.ods)
- Adobe FrameMaker files (*.mif)
- Adobe PDF documents (*.pdf) (license required)
- Image files (*.jpeg, *.jpg, *.bmp, *.gif, *.tif, *.tiff, *.png, *.djvu, *.dcx,
*.dib, *.jb2, *.jp2, *.j2k, *.jpf, *.jpx, *.pcx, *.wdp) (license required)
Important! Support for *.djvu will be discontinued in future versions of ABBYY
Smart Classifier. Please contact ABBYY if you need to process files in this format.