Tesseract OCR

GPTKB entity

Statements (33)
Predicate Object
gptkbp:instanceOf optical character recognition software
gptkbp:acquiredBy gptkb:Google
gptkbp:category gptkb:graphical_user_interface
gptkb:software
OCR engine
gptkbp:contributedTo gptkb:software
gptkbp:developedBy gptkb:Hewlett-Packard_Labs
gptkbp:developer gptkb:Google
gptkb:Hewlett-Packard
gptkbp:firstReleased 2005
https://www.w3.org/2000/01/rdf-schema#label Tesseract OCR
gptkbp:latestReleaseVersion 5.3.3
2023-06-01
gptkbp:license gptkb:Apache_License_2.0
gptkbp:operatingSystem gptkb:Windows
gptkb:macOS
gptkb:Linux
gptkbp:programmingLanguage gptkb:C++
gptkbp:repository https://github.com/tesseract-ocr/tesseract
gptkbp:sourceModel open source
gptkbp:supports gptkb:Unicode
PDF output
image preprocessing
custom OCR models
right-to-left languages
training for new languages
gptkbp:supportsLanguage over 100 languages
gptkbp:usedFor automated data entry
document digitization
text extraction from images
gptkbp:website https://tesseract-ocr.github.io/
gptkbp:bfsParent gptkb:hOCR
gptkbp:bfsLayer 7