Show home:seife:Factory / tesseract-ocr

Overview Repositories Revisions Requests Users Attributes Meta

Tesseract Open Source OCR Engine

Tesseract is a free optical character recognition engine originally developed at Hewlett-Packard and currently developed by Google. It is a raw OCR engine - it has no document layout analysis, no output formatting, and no graphical user interface. It only processes a TIFF or BMP image of a single column and creates text from it. It can detect fixed pitch vs proportional text. The engine was in the top 3 in terms of character accuracy in 1995. The source code will read a binary, grey or color image and output text.

Tesseract can process English, French, Italian, German, Spanish, Brazilian, Portuguese and Dutch and can be trained to work in other languages as well.

Developed at Publishing
Sources inherited from project openSUSE:Factory
1 derived packages
Derived Packages
Publishing
Cancel
Download package
Checkout Package
osc -A https://api.opensuse.org checkout home:seife:Factory/tesseract-ocr && cd $_
Create Badge

Build Results
RPM Lint

Refresh

Source Files

Filename	Size	Changed
baselibs.conf	0000000014 14 Bytes	over 1 year ago
tesseract-5.3.1.tar.gz	0001916779 1.83 MB	over 1 year ago
tesseract-ocr.changes	0000013803 13.5 KB	over 1 year ago
tesseract-ocr.spec	0000003928 3.84 KB	over 1 year ago

Revision 14 (latest revision is 18)

Dominique Leuenberger (dimstar_suse) accepted request 1091719 from

Ondřej Súkup (mimi_vx) over 1 year ago (revision 14)

- update to 5.3.1
- revert back to autoconf build as upstrem doesn't support CMAKE
   outside windows
  * Bugfixes for special case scenarios (forwarded request 1091718 from mimi_vx)

Places

Actions on this page

Tesseract Open Source OCR Engine

Edit Package tesseract-ocr

Source Files

Revision 14 (latest revision is 18)

Comments 0

Places