Tesseract is an open source OCR or optical character recognition engine and command line program. OCR is a technology that allows for the recognition of text characters within a digital image. With the latest version of Tesseract, there is a greater focus on line recognition, however it still supports the legacy Tesseract OCR engine which recognizes character patterns.

Tesseract can recognize over 100 languages out-of-the-box, and can be trained to recognize other languages. It supports various output formats, including plain text, HTML, PDF and more. It also has unicode (UTF-8) support.

Features

  • OCR engine and command line program
  • Line recognition and character pattern recognition
  • Unicode (UTF-8) support
  • Recognizes more than 100 languages, and can be trained to recognize others
  • Supports various output formats

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0

Follow Tesseract OCR

Tesseract OCR Web Site

Other Useful Business Software
Try Google Cloud Risk-Free With $300 in Credit Icon
Try Google Cloud Risk-Free With $300 in Credit

No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
Start Free
Rate This Project
Login To Rate This Project

User Ratings

★★★★★
★★★★
★★★
★★
5
0
0
0
0
ease 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 4 / 5
features 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 4 / 5
design 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 4 / 5
support 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 4 / 5

User Reviews

  • Enjoy this project for my mission
  • Brilliant. Worked properly first time. great code.
  • very good OCR project!
  • wow, good OCR. The release files are very oldest than http://code.google.com/p/tesseract-ocr/ I packed tesseract with gImageReader http://sourceforge.net/projects/gimagereader/
  • how to install in win Xp?
Read more reviews >

Additional Project Details

Operating Systems

Linux, Mac, Windows

Programming Language

C++

Related Categories

C++ Image Recognition Software, C++ OCR Software

Registered

2020-05-04