Showing 48 open source projects for "tesseract"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    Tesseract OCR

    Tesseract OCR

    Open Source OCR Engine

    Tesseract is an open source OCR or optical character recognition engine and command line program. OCR is a technology that allows for the recognition of text characters within a digital image. With the latest version of Tesseract, there is a greater focus on line recognition, however it still supports the legacy Tesseract OCR engine which recognizes character patterns.
    Downloads: 3,034 This Week
    Last Update:
    See Project
  • 2
    Tesseract.js

    Tesseract.js

    A pure Javascript Multilingual OCR

    Tesseract.js is a pure Javascript port of the popular Tesseract OCR engine. Tesseract.js' library supports more than 100 languages, automatic text orientation and script detection, a simple interface for reading paragraph, word, and character bounding boxes. Tesseract.js can run either in a browser and on a server with NodeJS. Tesseract.js is a javascript library that gets words in almost any spoken language out of images.
    Downloads: 20 This Week
    Last Update:
    See Project
  • 3
    Paperless-ngx

    Paperless-ngx

    A community-supported supercharged version of paperless

    Paperless-ngx is a community-supported open-source document management system that transforms your physical documents into a searchable online archive so you can keep, well, less paper.
    Downloads: 15 This Week
    Last Update:
    See Project
  • 4
    JavaCV

    JavaCV

    Java interface to OpenCV, FFmpeg, and more

    JavaCV uses wrappers from the JavaCPP Presets of commonly used libraries by researchers in the field of computer vision (OpenCV, FFmpeg, libdc1394, FlyCapture, Spinnaker, OpenKinect, librealsense, CL PS3 Eye Driver, videoInput, ARToolKitPlus, flandmark, Leptonica, and Tesseract) and provides utility classes to make their functionality easier to use on the Java platform, including Android. JavaCV also comes with hardware accelerated full-screen image display (CanvasFrame and GLCanvasFrame), easy-to-use methods to execute code in parallel on multiple cores (Parallel), user-friendly geometric and color calibration of cameras and projectors (GeometricCalibrator, ProCamGeometricCalibrator, ProCamColorCalibrator), detection and matching of feature points (ObjectFinder), a set of classes that implement direct image alignment of projector-camera systems (mainly GNImageAligner, ProjectiveTransformer, ProjectiveColorTransformer, ProCamTransformer, and ReflectanceInitializer), and more.
    Downloads: 16 This Week
    Last Update:
    See Project
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5
    LLM-Aided OCR Project

    LLM-Aided OCR Project

    Enhances Tesseract OCR output using LLMs (local or API)

    LLM Aided OCR is an open-source system designed to improve optical character recognition accuracy by combining traditional OCR tools with large language models. The project addresses common OCR challenges such as distorted text, unusual fonts, historical documents, and complex layouts that often produce inaccurate results with standard OCR pipelines. The system first extracts raw text using OCR engines and then applies language models to analyze and correct recognition errors based on...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6
    Extractous

    Extractous

    Fast and efficient unstructured data extraction

    ...For broader format support, the system combines its Rust core with ahead-of-time compiled Apache Tika shared libraries, which allows it to extend parsing coverage while still avoiding traditional server-based overhead. It also supports OCR for images and scanned documents through Tesseract, making it useful for document ingestion pipelines that include image-based or scanned inputs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Provides optical character recognition (OCR) solutions for Vietnamese language.
    Leader badge
    Downloads: 192 This Week
    Last Update:
    See Project
  • 8
    LabMACOSX
    ...Packetstorm is all packet storm exploits database all inserted inside in the application Packetstorm, Shoppingosx an example of https connection. The application Ob open four browsers, Tesseract applications container, TAL, Chatosx, AndroMac, AndroEmu, MySMS, Shutdown, Updater, Search for, MyMaps, SOA, BLI, Xcoder, BasicProg, BasicProg2, VM, D,M&Z Suite Basic and D,M&Z Suite Advanced (POSIX based).
    Downloads: 4 This Week
    Last Update:
    See Project
  • 9
    A GUI to ease the process of producing a multipage PDF from a scan. gscan2pdf should work on almost any Linux/BSD machine.
    Leader badge
    Downloads: 178 This Week
    Last Update:
    See Project
  • Application Monitoring That Won't Slow Your App Down Icon
    Application Monitoring That Won't Slow Your App Down

    AppSignal's Rust-based agent is lightweight and stable. Already running in thousands of production apps.

    Full APM with errors, performance, logs, and uptime monitoring. 99.999% uptime SLA on the platform itself.
    Start Free
  • 10
    Multiuser HylaFAX PHP/MySQL Web interface for viewing faxes online, downloading & emailing in PDF format, and categorizing & archiving all sent and received faxes.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    Screen Translate

    Screen Translate

    An OCR translator tool made by utilizing tesseract & python-opencv

    STL is an easy to use and light OCR translator tool that can be use to translate your screen. Made with python by utilizing Tesseract and opencv-python. For full view of the project you can check the Github repository: https://github.com/Dadangdut33/Screen-Translate REQUIREMENTS - Tesseract : https://github.com/UB-Mannheim/tesseract/wiki. Needed for the ocr. Install it with all the language pack. - Libretranslate (Optional for offline translation support) - Internet connection for translation if not using libretranslate # Tutorial on How To Setup https://github.com/Dadangdut33/Screen-Translate#installation-and-setup
    Leader badge
    Downloads: 27 This Week
    Last Update:
    See Project
  • 12
    Convert-Screenshot-To-Text
    ...If you only need to recognize English, please only select English." -No installation required. It's ready to use as soon as you open it.- I have made a major upgrade to CSTT this time, including support for all Tesseract-supported languages, improved OCR accuracy, added multiple recognition modes, added keyboard shortcuts for canvas movement and zooming, and enabled users to adjust OCR settings. If you like it, please support me. Author: A_A Email: A_A_kent_leung@hotmail.com Donation: (Buy Me a Coffee) https://www.buymeacoffee.com/AAkent (PATREON) patreon.com/A_A_KENT (PAYPAL) https://www.paypal.com/paypalme/AAKENT
    Downloads: 7 This Week
    Last Update:
    See Project
  • 13
    gImageReader

    gImageReader

    A graphical frontend to tesseract-ocr

    gImageReader is a simple Gtk/Qt front-end to tesseract. Features include: - Import PDF documents and images from disk, scanning devices, clipboard and screenshots - Process multiple images and documents in one go - Manual or automatic recognition area definition - Recognize to plain text or to hOCR documents - Recognized text displayed directly next to the image - Post-process the recognized text, including spellchecking - Generate PDF documents from hOCR documents **Note**: This page is only a mirror for the downloads. ...
    Leader badge
    Downloads: 152 This Week
    Last Update:
    See Project
  • 14

    Screen Capture Image Text App Launcher

    Run defined applications by detecting text in a captured screenshot

    This application requires the "TESSERACT" OCR engine to decode text in a captured screenshot. The text file can be analysed to look for specific trigger words which will run a defined application. TESSERACT OCR is available for Windows users here: https://digi.bib.uni-mannheim.de/tesseract/ Information on its use is generally available, this medium post provides an overview: https://medium.com/quantrium-tech/installing-and-using-tesseract-4-on-windows-10-4f7930313f82 During the testing I used version 5 of the software. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Tesseract Animator

    Tesseract Animator

    This program animates a shaded tesseract

    This program animates a shaded tesseract in a GTK window.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Linux-Intelligent-Ocr-Solution

    Linux-Intelligent-Ocr-Solution

    Easy-OCR solution and Tesseract trainer for GNU/Linux

    Linux-intelligent-ocr-solution Lios is a free and open source software for converting print in to text using either scanner or a camera, It can also produce text out of scanned images from other sources such as Pdf, Image, Folder containing Images or screenshot. Program is given total accessibility for visually impaired. A Tesseract Trainer GUI is also shipped with this package. Forum : https://groups.google.com/forum/#!forum/lios Video Tutorial : https://www.youtube.com/playlist?list=PLn29o8rxtRe1zS1r2-yGm1DNMOZCgdU0i Tesseract Training Tutorial (beta) : https://www.youtube.com/watch?v=qLpCld4cdtk Source Code Github : https://github.com/Nalin-x-Linux/lios-3 Gitlab : https://gitlab.com/Nalin-x-Linux/lios-3 User guide is available in download page
    Downloads: 8 This Week
    Last Update:
    See Project
  • 17

    cuneiformplus

    Fork of OCR software cuneiform

    Fork of OCR software cuneiform Original software see: https://launchpad.net/cuneiform-linux by Cognitive Technologies and Jussi Pakkanen Other Open Source OCR stuff see * Tesseract by Ray Smith (using the Leptonica image library) * GOCR * OCRAD
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    OCR Image Simply

    OCR Image Simply

    Simple Windows application to OCR images

    Probably the simplest Windows application to OCR images with use of Tesseract 3.05.02. Languages recognized: German, English, French, Italian, Polish, Spanish Just download ZIP file Then unzip archive And feel free to use everywhere - Solution published under MIT license Description can be found at: https://coolautomations.com/ocr-as-simple-as-it-can-be/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    BL3-MayhemMod

    BL3-MayhemMod

    Automatically Re-roll Mayhem Modifiers in Borderlands3

    An Autoit Script for re-rolling the Mayhem 10 modifiers in Borderlands 3 automatically. Uses Tesseract-OCR for text recognition.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    SwiftOCR

    SwiftOCR

    Fast and simple OCR library written in Swift

    ...As of now, SwiftOCR is optimized for recognizing short, one-line long alphanumeric codes (e.g. DI4C9CM). We currently support iOS and OS X. If you want to recognize normal text like a poem or a news article, go with Tesseract, but if you want to recognize short, alphanumeric codes (e.g. gift cards), I would advise you to choose SwiftOCR because that's where it exceeds. Tesseract is written in C++ and over 30 years old. To use it you first have to write a Objective-C++ wrapper for it. The main issue that's slowing down Tesseract is the way memory is managed. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    neocr

    Provides OCR solutions for Nepali, based on Tesseract 4.0.

    NeOCR is a free software based on Tesseract (Open Source OCR Engine) for the Windows operating system. It provides an easy and user-friendly user interface to recognize texts contained in images as well as PDF documents and convert to editable text formats (.txt, .doc, .docx). This product is accessible to Blind and Visually Impaired peoples (tested with NVDA and Narrator).
    Downloads: 14 This Week
    Last Update:
    See Project
  • 22

    OCR Template Creator

    Create template for images or PDF files to be OCR'ed and databased

    An OCR application which allows for tag/value templates to be created with a Web GUI for handling the automatic processing of PDF's or images of documents/receipts/contracts etc. Interface to Tesseract
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    ...It is able to recognize the page layout even for multicolumn text. Essentially, pdfsandwich is a wrapper script which calls the following binaries: convert, unpaper, tesseract, gs, and hocr2pdf (if tesseract < 3.03). It is known to run on Unix systems and has been tested on Linux and MacOS X. It supports parallel processing on multiprocessor systems. In contrast to most competing sandwich programs, it performs preprocessing of the scanned images, such as de-skewing or removal of dark edges etc. ...
    Leader badge
    Downloads: 374 This Week
    Last Update:
    See Project
  • 24
    A Java JNA wrapper for Tesseract OCR API
    Leader badge
    Downloads: 87 This Week
    Last Update:
    See Project
  • 25
    JATI - Just Another Tesseract Interface

    JATI - Just Another Tesseract Interface

    Another interface for tesseract OCR to convert image to text.

    Tesseract OCR is an open source, highly accurate image to text converter. Nevertheless, Tesseract OCR provides only command line interface. JATI is just another interface to the Tesseract OCR engine, providing GUI interface to convert an image to text. It can do batch conversion, including converting only portion of the image into text.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB