Tesseract Open Source OCR Engine
Tesseract is a free optical character recognition engine originally developed at Hewlett-Packard and currently developed by Google. It is a raw OCR engine - it has no document layout analysis, no output formatting, and no graphical user interface. It only processes a TIFF or BMP image of a single column and creates text from it. It can detect fixed pitch vs proportional text. The engine was in the top 3 in terms of character accuracy in 1995. The source code will read a binary, grey or color image and output text.
Tesseract can process English, French, Italian, German, Spanish, Brazilian, Portuguese and Dutch and can be trained to work in other languages as well.
- Sources inherited from project openSUSE:Backports:SLE-15-SP5:Update
- Download package
-
Checkout Package
osc -A https://api.opensuse.org checkout openSUSE:Leap:15.5:Update/tesseract-ocr.17951 && cd $_
- Create Badge
Refresh
Refresh
Source Files
Filename | Size | Changed |
---|---|---|
baselibs.conf | 0000000014 14 Bytes | |
tesseract-5.3.1.tar.gz | 0001916779 1.83 MB | |
tesseract-ocr.changes | 0000013803 13.5 KB | |
tesseract-ocr.spec | 0000003928 3.84 KB |
Latest Revision
Maintenance Automation (maintenance-robot)
accepted
request 1092506
from
Maintenance Automation (maintenance-robot)
(revision 1)
Release from openSUSE:Maintenance:17951 / tesseract-ocr.openSUSE_Backports_SLE-15-SP5_Update
Comments 0