tesseract ocr php free download

Showing 50 open source projects for "tesseract ocr php"

View related business solutions

MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build generative AI apps with Vertex AI. Switch between models without switching platforms.

Start Free
1

Tesseract OCR

Open Source OCR Engine

Tesseract is an open source OCR or optical character recognition engine and command line program. OCR is a technology that allows for the recognition of text characters within a digital image. With the latest version of Tesseract, there is a greater focus on line recognition, however it still supports the legacy Tesseract OCR engine which recognizes character patterns.

5 Reviews

Downloads: 3,034 This Week

Last Update: 2025-12-26
See Project
2

Tesseract.js

A pure Javascript Multilingual OCR

Tesseract.js is a pure Javascript port of the popular Tesseract OCR engine. Tesseract.js' library supports more than 100 languages, automatic text orientation and script detection, a simple interface for reading paragraph, word, and character bounding boxes. Tesseract.js can run either in a browser and on a server with NodeJS. Tesseract.js is a javascript library that gets words in almost any spoken language out of images.

Downloads: 20 This Week

Last Update: 2025-12-15
See Project
3

LLM-Aided OCR Project

Enhances Tesseract OCR output using LLMs (local or API)

LLM Aided OCR is an open-source system designed to improve optical character recognition accuracy by combining traditional OCR tools with large language models. The project addresses common OCR challenges such as distorted text, unusual fonts, historical documents, and complex layouts that often produce inaccurate results with standard OCR pipelines. The system first extracts raw text using OCR engines and then applies language models to analyze and correct recognition errors based on...

Downloads: 3 This Week

Last Update: 5 days ago
See Project
4

Extractous

Fast and efficient unstructured data extraction

...For broader format support, the system combines its Rust core with ahead-of-time compiled Apache Tika shared libraries, which allows it to extend parsing coverage while still avoiding traditional server-based overhead. It also supports OCR for images and scanned documents through Tesseract, making it useful for document ingestion pipelines that include image-based or scanned inputs.

Downloads: 0 This Week

Last Update: 2026-03-06
See Project
Try Google Cloud Risk-Free With $300 in Credit
No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.

Start Free
5

VietOCR

Provides optical character recognition (OCR) solutions for Vietnamese language.

24 Reviews

Downloads: 192 This Week

Last Update: 2026-01-17
See Project
6

Paperless-ngx

A community-supported supercharged version of paperless

Paperless-ngx is a community-supported open-source document management system that transforms your physical documents into a searchable online archive so you can keep, well, less paper.

Downloads: 15 This Week

Last Update: 6 days ago
See Project
7

gscan2pdf

A GUI to ease the process of producing a multipage PDF from a scan. gscan2pdf should work on almost any Linux/BSD machine.

22 Reviews

Downloads: 178 This Week

Last Update: 2025-11-05
See Project
8

AvantFAX

Multiuser HylaFAX PHP/MySQL Web interface for viewing faxes online, downloading & emailing in PDF format, and categorizing & archiving all sent and received faxes.

10 Reviews

Downloads: 2 This Week

Last Update: 2025-04-10
See Project
9

DocWire SDK

Award-winning modern data processing SDK in C++20

DocWire SDK, a standout C++20AI driven data processing tool, has received award from SourceForge and strong backing from Microsoft. It handles nearly 100 file types, empowering efficient text extraction, web data extraction, and document analysis. For businesses, the shift to DocWire SDK signifies a leap forward. It promises comprehensive document format support and the ability to extract valuable insights from email boxes, databases, and websites using cutting-edge AI. DocWire SDK aims to...

Downloads: 3 This Week

Last Update: 7 hours ago
See Project
Full-stack observability with actually useful AI | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
10

Screen Translate

An OCR translator tool made by utilizing tesseract & python-opencv

STL is an easy to use and light OCR translator tool that can be use to translate your screen. Made with python by utilizing Tesseract and opencv-python. For full view of the project you can check the Github repository: https://github.com/Dadangdut33/Screen-Translate REQUIREMENTS - Tesseract : https://github.com/UB-Mannheim/tesseract/wiki. Needed for the ocr.

3 Reviews

Downloads: 27 This Week

Last Update: 2023-02-08
See Project
11

Convert-Screenshot-To-Text

...If you only need to recognize English, please only select English." -No installation required. It's ready to use as soon as you open it.- I have made a major upgrade to CSTT this time, including support for all Tesseract-supported languages, improved OCR accuracy, added multiple recognition modes, added keyboard shortcuts for canvas movement and zooming, and enabled users to adjust OCR settings. If you like it, please support me. Author: A_A Email: A_A_kent_leung@hotmail.com Donation: (Buy Me a Coffee) https://www.buymeacoffee.com/AAkent (PATREON) patreon.com/A_A_KENT (PAYPAL) https://www.paypal.com/paypalme/AAKENT

1 Review

Downloads: 7 This Week

Last Update: 2023-04-05
See Project
12

gImageReader

A graphical frontend to tesseract-ocr

gImageReader is a simple Gtk/Qt front-end to tesseract. Features include: - Import PDF documents and images from disk, scanning devices, clipboard and screenshots - Process multiple images and documents in one go - Manual or automatic recognition area definition - Recognize to plain text or to hOCR documents - Recognized text displayed directly next to the image - Post-process the recognized text, including spellchecking - Generate PDF documents from hOCR documents **Note**:...

27 Reviews

Downloads: 152 This Week

Last Update: 2022-01-28
See Project
13

Screen Capture Image Text App Launcher

Run defined applications by detecting text in a captured screenshot

This application requires the "TESSERACT" OCR engine to decode text in a captured screenshot. The text file can be analysed to look for specific trigger words which will run a defined application. TESSERACT OCR is available for Windows users here: https://digi.bib.uni-mannheim.de/tesseract/ Information on its use is generally available, this medium post provides an overview: https://medium.com/quantrium-tech/installing-and-using-tesseract-4-on-windows-10-4f7930313f82 During the testing I used version 5 of the software. ...

Downloads: 0 This Week

Last Update: 2021-04-05
See Project
14

Linux-Intelligent-Ocr-Solution

Easy-OCR solution and Tesseract trainer for GNU/Linux

Linux-intelligent-ocr-solution Lios is a free and open source software for converting print in to text using either scanner or a camera, It can also produce text out of scanned images from other sources such as Pdf, Image, Folder containing Images or screenshot. Program is given total accessibility for visually impaired. A Tesseract Trainer GUI is also shipped with this package.

5 Reviews

Downloads: 8 This Week

Last Update: 2020-10-19
See Project
15

cuneiformplus

Fork of OCR software cuneiform

Fork of OCR software cuneiform Original software see: https://launchpad.net/cuneiform-linux by Cognitive Technologies and Jussi Pakkanen Other Open Source OCR stuff see * Tesseract by Ray Smith (using the Leptonica image library) * GOCR * OCRAD

Downloads: 1 This Week

Last Update: 2020-12-08
See Project
16

OCR Image Simply

Simple Windows application to OCR images

Probably the simplest Windows application to OCR images with use of Tesseract 3.05.02. Languages recognized: German, English, French, Italian, Polish, Spanish Just download ZIP file Then unzip archive And feel free to use everywhere - Solution published under MIT license Description can be found at: https://coolautomations.com/ocr-as-simple-as-it-can-be/

Downloads: 0 This Week

Last Update: 2020-11-03
See Project
17

BL3-MayhemMod

Automatically Re-roll Mayhem Modifiers in Borderlands3

An Autoit Script for re-rolling the Mayhem 10 modifiers in Borderlands 3 automatically. Uses Tesseract-OCR for text recognition.

Downloads: 0 This Week

Last Update: 2020-10-25
See Project
18

neocr

Provides OCR solutions for Nepali, based on Tesseract 4.0.

NeOCR is a free software based on Tesseract (Open Source OCR Engine) for the Windows operating system. It provides an easy and user-friendly user interface to recognize texts contained in images as well as PDF documents and convert to editable text formats (.txt, .doc, .docx). This product is accessible to Blind and Visually Impaired peoples (tested with NVDA and Narrator).

3 Reviews

Downloads: 14 This Week

Last Update: 2020-04-17
See Project
19

SwiftOCR

Fast and simple OCR library written in Swift

SwiftOCR is a fast and simple OCR library written in Swift. It uses a neural network for image recognition. As of now, SwiftOCR is optimized for recognizing short, one-line long alphanumeric codes (e.g. DI4C9CM). We currently support iOS and OS X. If you want to recognize normal text like a poem or a news article, go with Tesseract, but if you want to recognize short, alphanumeric codes (e.g. gift cards), I would advise you to choose SwiftOCR because that's where it exceeds. ...

Downloads: 0 This Week

Last Update: 2023-05-29
See Project
20

OCR Template Creator

Create template for images or PDF files to be OCR'ed and databased

An OCR application which allows for tag/value templates to be created with a Web GUI for handling the automatic processing of PDF's or images of documents/receipts/contracts etc. Interface to Tesseract

Downloads: 2 This Week

Last Update: 2018-12-15
See Project
21

pdfsandwich

pdfsandwich generates "sandwich" OCR pdf files, i.e. pdf files which contain only images (but no editable text) will be processed by optical character recognition (OCR) and the text will be added to each page invisibly "behind" the images. pdfsandwich is a command line tool which is supposed to be useful to OCR scanned books or journals. It is able to recognize the page layout even for multicolumn text.

8 Reviews

Downloads: 374 This Week

Last Update: 2018-08-12
See Project
22

JATI - Just Another Tesseract Interface

Another interface for tesseract OCR to convert image to text.

Tesseract OCR is an open source, highly accurate image to text converter. Nevertheless, Tesseract OCR provides only command line interface. JATI is just another interface to the Tesseract OCR engine, providing GUI interface to convert an image to text. It can do batch conversion, including converting only portion of the image into text.

3 Reviews

Downloads: 3 This Week

Last Update: 2018-08-31
See Project
23

Tess4J

A Java JNA wrapper for Tesseract OCR API

9 Reviews

Downloads: 87 This Week

Last Update: 2018-05-26
See Project
24

FormRead

Free OMR - OCR web sofware based on javascript and PHP

https://formread.org FormRead is a completely free OMR (optical mark recognition) web software for scanning and grading user-filled, multiple choice forms. Create your formats with any of your office or drawing tools, scan them and parameterize their coordinates in an easy way. Once you have parameterized your form, you can print many of them, give it to your students/respondents, scan and recognize them with formread, and you can finally export the data in your preferred formats...

Downloads: 5 This Week

Last Update: 2022-03-04
See Project
25

cbrTekStraktor

an application to automatically extract text from comic books.

cbrTekStraktor is an application to automatically extract text from the text bubbles or speech balloons present in comic book reader files (CBR). Its prime goal is to perform analysis on the texts of comic books. cbrTekStraktor can however also be used for scanlation or similar purposes. The application also enables to manually define text areas in CBR files. The application comprises a simple graphical editor for further processing the extracted text. The text extraction is...

Downloads: 0 This Week

Last Update: 2017-06-14
See Project