Jump to content

Comparison of optical character recognition software

From Wikipedia, the free encyclopedia
(Redirected from OCR SDK)

This comparison of optical character recognition software includes:

  • OCR engines, that do the actual character identification
  • Layout analysis software, that divide scanned documents into zones suitable for OCR
  • Graphical interfaces to one or more OCR engines
  • Software development kits that are used to add OCR capabilities to other software (e.g. forms processing applications, document imaging management systems, e-discovery systems, records management solutions)
Sortable table
Name Founded year Latest stable version Release year License Online Windows Mac OS X Linux BSD Android iOS Programming language SDK? Languages Fonts Output Formats Notes
ABBYY FineReader 1989 16 2022 Proprietary Yes Yes Yes No Yes Yes Yes C/C++ Yes 192[1] All fonts DOC, DOCX, XLS, XLSX, PPTX, RTF, PDF, HTML, CSV, TXT, ODT, DjVu, EPUB, FB2[2] ABBYY also supplies SDKs for embedded and mobile devices. Professional, Corporate and Site License Editions for Windows, Express Edition for Mac.[3]
AnyDoc Software 1989 ? ? Proprietary No Yes No No No ? ? VBScript ? ? ? Works with structured, semi-structured, and unstructured documents.
Asprise OCR SDK 1998 15 2015 Proprietary Yes Yes Yes Yes Yes ? ? Java, C#,VB.NET, C/C++/Delphi Yes 20+[4] ? Plain text, searchable PDF, XML[5] Java, C#, VB.NET, C/C++/Delphi SDKs for OCR and Barcode recognition on Windows, Linux, Mac OS X and Unix.[6]
CuneiForm 1996 1.1 2011 BSD variant No Yes Yes Yes Yes ? ? C/C++ Yes 28 Any printed font HTML, hOCR, native, RTF, TeX, TXT[7] Enterprise-class system, can save text formatting and recognizes complicated tables of any structure
E-aksharayan 2010 Yes No Yes No ? ? 14 RTF, TXT, BRL
GOCR 2000 0.52[8] 2018 GPL Yes[9] Yes Yes Yes Yes ? ? C ? 20+ ?
Google Drive OCR or Google Cloud Vision 2015 Proprietary Yes Browser Browser Browser Unknown ? ? Unknown Yes 200+ All fonts text Google blog post[10][11]
Microsoft Office Document Imaging ? Office 2007 2007 Proprietary No Yes No No No ? ? ? ? ? ? Uses OmniPage[citation needed]
Microsoft Office OneNote 2007 2011 ? 2007 Proprietary No Yes No No No ? ? ? ? ? ?
OCRFeeder 2009-03 0.8.5 2022 GPL No No No Yes No ? ? Python ? ? ? Features a full user interface and has a command-line tool for automatic operations. Has its own segmentation algorithm but uses system-wide OCR engines like Tesseract or Ocrad
Ocrad ? 0.28[12] 2022 GPL Yes No Yes Yes Yes ? ? C++ Yes Latin alphabet ? Command line
OCRopus 2007 1.3.3 2017 Apache No No Yes Yes Yes ? ? Python ? All languages using Latin script (other languages can be trained) Normal Latin script and Fraktur (other scripts can be trained) TXT, hOCR,[13] PDF[14] Pluggable framework under active development, used for Google Books
OmniPage 1970s 19.2 2015 Proprietary Yes Yes Yes Yes No ? ? C/C++, C#[15] Yes 125[16] Machine and handprinted fonts DOC/DOCX XLS/XLSX PPTX RTF PDF PDF/A Searchable PDF HTML Text XML ePUB MP3 Product of Nuance Communications
Puma.NET ? ? 2009 BSD No Yes No No No ? ? C# Yes 28 Any printed font .NET OCR SDK based on Cognitive Technologies' CuneiForm recognition engine. Wraps Puma COM server and provides simplified API for .NET applications
ReadSoft ? ? ? Proprietary No Yes No No No ? ? ? ? ? ? Scan, capture and classify business documents such as invoices, forms and purchase orders integrated with business processes.
Scantron ? ? ? Proprietary No Yes No No No ? ? ? ? ? ? For working with localized interfaces, corresponding language support is required.
SmartScore 1991 10.5.8 2015 Proprietary No Yes Yes No No ? ? ? ? ? ? For musical scores
Tesseract 1985 5.3.3 2023 Apache No Yes Yes Yes Yes ? ? C++, C Yes 100+[17] Any printed font Text, ALTO, hOCR,[18] PDF, others with different user interfaces[19] or the API Created by Hewlett-Packard; under further development by Google[20]
Name Founded year Latest stable version Release year License Online Windows Mac OS X Linux BSD Android iOS Programming language SDK? Languages Fonts Output Formats Notes

Evaluation

[edit]

A 2016 analysis of the accuracy and reliability of the OCR packages Google Docs OCR, Tesseract, ABBYY FineReader, and Transym, employing a dataset including 1227 images from 15 different categories concluded Google Docs OCR and ABBYY to be performing better than others.[21]

References

[edit]
  1. ^ "ABBYY FineReader 14: Technical Specifications". Finereader.abbyy.com. Retrieved 2017-02-23.
  2. ^ "ABBYY FineReader 11: Technical Specifications". Finereader.abbyy.com. Retrieved 2013-09-12.
  3. ^ "Top OCR Software". Ocrworld.com. 2010-03-30. Archived from the original on 2017-02-23. Retrieved 2013-09-12.
  4. ^ "Asprise OCR SDK Features". asprise.com. Retrieved 2014-06-21.
  5. ^ "Asprise Java OCR Library Features". asprise.com. Retrieved 2014-06-21.
  6. ^ "Asprise Java, C#/VB.NET OCR API". asprise.com. 2015-11-19. Retrieved 2015-11-19.
  7. ^ Debian manual page for Cuneiform for Linux version 1.1.0
  8. ^ "GOCR Homepage". wasd.urz.uni-magdeburg.de. Retrieved 2018-10-17.
  9. ^ "GOCR". Jocr.sourceforge.net. Retrieved 2013-09-12.
  10. ^ "Supported languages". Feb 11, 2022.
  11. ^ Ashok Popat (Sep 4, 2015). "IEEE SPS: Optical Character Recognition for Most of the World's Languages". YouTube. Archived from the original on 2021-12-20.
  12. ^ Diaz, Antonio (2022-01-17). "GNU Ocrad 0.28 released" (Mailing list). info-gnu.
  13. ^ OCRopus includes the ocropus-hocr tool which produces hOCR from the recognition results.
  14. ^ In combination with the hocr-tools
  15. ^ "OmniPage CSDK - OCR Document Capture Toolkit | Document Imaging & OCR". Nuance. Archived from the original on 2010-08-24. Retrieved 2013-09-12.
  16. ^ "OmniPage Standard Document Conversion". Nuance. Archived from the original on 2014-03-13. Retrieved 2014-02-25.
  17. ^ Based on count of language training files for version 3.04. Available at the download page.
  18. ^ Usage explained in the Tesseract Readme and FAQ
  19. ^ Such as ODF with OCRFeeder
  20. ^ "GitHub - tesseract-ocr/tesseract: Tesseract Open Source OCR Engine (main repository)". GitHub. Retrieved 2018-11-05.
  21. ^ Assefi, Mehdi (2016-12-01). "OCR as a Service: An Experimental Evaluation of Google Docs OCR, Tesseract, ABBYY FineReader, and Transym". ResearchGate. Retrieved 2019-01-31.