Showing 59 open source projects for "divide scanned images"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Powering the best of the internet | Fastly Icon
    Powering the best of the internet | Fastly

    Fastly's edge cloud platform delivers faster, safer, and more scalable sites and apps to customers.

    Ensure your websites, applications and services can effortlessly handle the demands of your users with Fastly. Fastly’s portfolio is designed to be highly performant, personalized and secure while seamlessly scaling to support your growth.
    Try for free
  • 1
    OCRmyPDF

    OCRmyPDF

    OCRmyPDF adds an OCR text layer to scanned PDF files

    OCRmyPDF adds an optical character recognition (OCR) text layer to scanned PDF files, allowing them to be searched. PDF is the best format for storing and exchanging scanned documents. Unfortunately, PDFs can be difficult to modify. OCRmyPDF makes it easy to apply image processing and OCR (recognized, searchable text) to existing PDFs.
    Downloads: 104 This Week
    Last Update:
    See Project
  • 2
    pikepdf

    pikepdf

    A Python library for reading and writing PDF, powered by QPDF

    pikepdf is a Python library allowing the creation, manipulation, and repair of PDFs. It provides a Pythonic wrapper around the C++ PDF content transformation library, QPDF. Python + QPDF = “py” + “qpdf” = “pyqpdf”, which looks like a dyslexia test and is no fun to type. But say “pyqpdf” out loud, and it sounds like “pikepdf”. pikepdf is a library intended for developers who want to create, manipulate, parse, repair, and abuse the PDF format. It supports reading and write PDFs, including...
    Downloads: 86 This Week
    Last Update:
    See Project
  • 3
    Paperless-ngx

    Paperless-ngx

    A community-supported supercharged version of paperless

    Paperless-ngx is a community-supported open-source document management system that transforms your physical documents into a searchable online archive so you can keep, well, less paper.
    Downloads: 15 This Week
    Last Update:
    See Project
  • 4
    Pix2Text

    Pix2Text

    Open-Source Python3 tool for recognizing layouts, tables, and math

    ... formulas, and integrate all of these contents into Markdown format. P2T can also convert an entire PDF file (which can contain scanned images or any other format) into Markdown format.
    Downloads: 9 This Week
    Last Update:
    See Project
  • Build Securely on Azure with Proven Frameworks Icon
    Build Securely on Azure with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 5
    deepdoctection

    deepdoctection

    A Repo For Document AI

    DeepDoctection is a document AI framework that applies deep learning techniques to analyze and extract structured data from scanned documents, PDFs, and images. deepdoctection is a Python library that orchestrates document extraction and document layout analysis tasks using deep learning models. It does not implement models but enables you to build pipelines using highly acknowledged libraries for object detection, OCR and selected NLP tasks and provides an integrated frameworks for fine-tuning...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 6

    Antivirus Live CD

    4MLinux fork including ClamAV scanner

    Antivirus Live CD is an official 4MLinux fork including the ClamAV scanner. It's designed for users who need a lightweight live CD, which will help them to protect their computers against viruses. Ethernet, WiFi, PPP and PPPoE are supported by Antivirus Live CD to enable automatic updates of its virus signature databases. All partitions are mounted during the boot process so that they can be scanned by ClamAV. The Antivirus Live CD ISO images are fully compatible with UNetbootin, which can...
    Leader badge
    Downloads: 160 This Week
    Last Update:
    See Project
  • 7

    Image To Text tools

    ITTT is a Free tool designed to Scan and extract Text from Images.

    Image To Text Tools is a 100% Free user-friendly tool designed to Scan and extract containing text in images into editable text formats. Whether you need to extract text from scanned documents, photographs, or other image files, Image To Text Tools provides accurate and reliable Optical Character Recognition (OCR) capabilities to meet your needs.
    Downloads: 42 This Week
    Last Update:
    See Project
  • 8
    LPub3D

    LPub3D

    LDraw™ editor for LEGO® style digital building instructions.

    LPub3D is an Open Source WYSIWYG editing application for creating LEGO® style digital building instructions. LPub3D is developed and maintained by Trevor SANDY. It uses the LDraw™ parts library, the most comprehensive library of digital Open Source LEGO® bricks available (www.ldraw.org/ ) and reads the LDraw LDR and MPD model file formats. LPub3D is available for free under the GNU Public License v3 and runs on Windows, Linux and macOS Operating Systems. LPub3D is also...
    Leader badge
    Downloads: 23 This Week
    Last Update:
    See Project
  • 9
    Super PDF Editor (a Batch PDF Processor)

    Super PDF Editor (a Batch PDF Processor)

    Create, Edit, Delete, Organize , Convert, Export, Secure & Sign PDF.

    Super PDF Editor is a robust and versatile PDF management software designed to streamline your document handling needs. Whether you're an individual, student, or professional, this software offers a comprehensive suite of tools to create, edit, and manage your PDFs with ease. Key Features: Extract Page: Easily extract specific pages from a PDF document. Split Page: Divide a single PDF page into multiple smaller pages. Rotate Page: Rotate pages to adjust their orientation. Merge Page...
    Leader badge
    Downloads: 14 This Week
    Last Update:
    See Project
  • Picsart Enterprise Background Removal API for Stunning eCommerce Visuals Icon
    Picsart Enterprise Background Removal API for Stunning eCommerce Visuals

    Instantly remove the background from your images in just one click.

    With our Remove Background API tool, you can access the transformative capabilities of automation , which will allow you to turn any photo asset into compelling product imagery. With elevated visuals quality on your digital platforms, you can captivate your audience, and therefore achieve higher engagement and sales.
    Learn More
  • 10

    planoMetrix

    Corrects scanner distortions using a reference scan for realignment.

    PlanoMetrix improves scan accuracy by correcting distortions caused by sensor motion mechanics and paper transport irregularities in flatbed and document feeder scanners. Many scanners introduce periodic errors and inconsistencies in horizontal scaling, leading to distortions in scanned documents. The software works in two phases: Reference Scan: A graph paper (e.g., millimeter paper) is scanned to determine alignment parameters. Correction Phase: Subsequent scans are adjusted based...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    MDCrop

    MDCrop

    Automatic batch cropping of images. Autorotation optional.

    MDCrop is an application that can be used for automatic cropping of images. Its typical use is cropping scanned documents in the process of digitization. The application also allows automatic rotation of the processed scan to an optimized position for subsequent cropping.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12

    image2relief

    Create CNC G-code from images for gemstone engraving.

    image2relief is a C++ program using OpenCV to convert raster images into G-code for CNC engraving of gemstones. It supports the creation of mid-reliefs, low reliefs, and shallow reliefs (without undercuts), using image-based surface profiling. The tool accounts for different end mill diameters by simulating circular tool footprints. The image is scanned stepwise along a customizable path. At each step, the maximum pixel value inside the end mill's footprint is used to determine engraving depth...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    scanExtract

    scanExtract: CLI tool to split and rotate scanned photos precisely.

    scanExtract is a command-line utility designed for quick and precise digitization of photo prints. Simply place multiple photos on a flatbed scanner, scan them, and let scanExtract automatically detect, split, and individually rotate each image to achieve perfectly aligned borders. Using high-precision algorithms, the tool handles each photo separately, eliminating the need for careful placement on the scanner. It saves images with an overflow border to ensure no details are lost. After...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    A Qt GUI for archiving old floppies, CD-ROMs and other types of media to organized collections of disk image files, along with scanned images of the disks' physical appearance so that you can find them later by their familiar looks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Super-PDF-Editor-Lite

    Super-PDF-Editor-Lite

    World's most comprehensive, powerful, process-based PDF editor

    .... Easy pdf imposition, booklet, n ups pages, and more. OCR performs in pdf files, scanned pdf files and any pdf files. OCR performs in image files, and supports multiple image formats. Auto and manual image enhancement for better OCR accuracy and quality. Supports 165+ languages with three languages data set. Use Multiple Languages at once. International Languages: 127 Languages, High, Medium, and Fast Quality. Scanned Images (jpg, png, gif, tiff, bmp) Multi-Page and TIFF and GIF, Scanned PDFs.
    Downloads: 17 This Week
    Last Update:
    See Project
  • 16
    Super-PDF-Editor

    Super-PDF-Editor

    World's most comprehensive, powerful, process-based PDF editor

    ... performs in pdf files, scanned pdf files and any pdf files. OCR performs in image files, and supports multiple image formats. Auto and manual image enhancement for better OCR accuracy and quality. Supports 165+ languages with three languages data set. Use Multiple Languages at once. International Languages: 127 Languages, High, Medium, and Fast Quality. Scanned Images (jpg, png, gif, tiff, bmp) Multi-Page and TIFF and GIF, Scanned PDFs.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 17
    AiHound

    AiHound

    AI powered image classification for nudity and documents / id-cards

    AI Hound is designed to run from an USB pendrive or any other kind of removeable and writeable media. The programm checks all Office-documents, Images and videos for various categories for images. Actually It can recognice nudity/porn and scanned or photographed documents / ID- and credit-cards. I am working on a model that also recognice various types of drugs in images.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    Karto

    Karto

    Tool to work on map data, to check the data and generate your own maps

    Karto allows to calibrate a scanned map (or aerial photos) with some known geo-points (Lambert, UTM, ...). After that, you can add any points (GPS points) in the map to build your own map. You can also add a transparent GIF image in the map after calibration (cave, mines, roads, ...). You can realize your own cartes with the data you add inside adding/showing/hiding layers of data. You can import/export your data files (after checking and correcting them) in differents formats.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Parsr

    Parsr

    Transforms PDF, Documents and Images into Enriched Structured Data

    Parsr is an open-source document parsing tool that converts PDFs, scanned images, and other structured documents into structured, machine-readable data formats.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 20
    Linux-Intelligent-Ocr-Solution

    Linux-Intelligent-Ocr-Solution

    Easy-OCR solution and Tesseract trainer for GNU/Linux

    Linux-intelligent-ocr-solution Lios is a free and open source software for converting print in to text using either scanner or a camera, It can also produce text out of scanned images from other sources such as Pdf, Image, Folder containing Images or screenshot. Program is given total accessibility for visually impaired. A Tesseract Trainer GUI is also shipped with this package. Forum : https://groups.google.com/forum/#!forum/lios Video Tutorial : https://www.youtube.com/playlist?list...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 21
    nonechucks

    nonechucks

    Deal with bad samples in your dataset dynamically

    nonechucks is a library that provides wrappers for PyTorch's datasets, samplers and transforms to allow for dropping unwanted or invalid samples dynamically. What if you have a dataset of 1000s of images, out of which a few dozen images are unreadable because the image files are corrupted? Or what if your dataset is a folder full of scanned PDFs that you have to OCRize, and then run a language detector on the resulting text, because you want only the ones that are in English? Or maybe you have...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    Merge PDF Files

    It is a Windows library that merges standard PDFs into a final PDF

    ... or asynchronous. We want to give you a benchmark, the library was used to create a PDF from single page(scanned) image by an OCR SDK (it is not included in our library, you can use any on the market): 20,000 Images (the OCR SDK creates single page PDF text searchable, running 50 threads) in 80 minutes. The size of the final PDF searchable was 800Mb. If you download the library, we provide a sample which cover all the scenarios possible (synchronous and asynchronous).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Random File Picker

    Random File Picker

    Choose (and open) random files from random directories.

    Random File Picker can quickly choose and open random files from a set of directories. Files can be filtered so you can for example look only for images. The directories that you supply to RFP will be scanned for sub-directories (first a directory is chosen, then a file from it). In short it is a tool that makes decisions in your place. Not sure what wallpaper to choose for today? Let it decide for you. Not sure what movie to watch? It can open a random one in just a few clicks.
    Leader badge
    Downloads: 48 This Week
    Last Update:
    See Project
  • 24
    pdfsandwich generates "sandwich" OCR pdf files, i.e. pdf files which contain only images (but no editable text) will be processed by optical character recognition (OCR) and the text will be added to each page invisibly "behind" the images. pdfsandwich is a command line tool which is supposed to be useful to OCR scanned books or journals. It is able to recognize the page layout even for multicolumn text. Essentially, pdfsandwich is a wrapper script which calls the following binaries...
    Leader badge
    Downloads: 310 This Week
    Last Update:
    See Project
  • 25
    Scrollout F1

    Scrollout F1

    An easy-to-use anti-spam email gateway

    • Designed for Linux and Windows email system administrators, Scrollout F1 is an easy to use, already adjusted email firewall (gateway) offering free anti-spam and anti-virus protection aiming to secure existing email servers, old or new, such as Microsoft Exchange, Lotus Domino, Postfix, Exim, Sendmail, Qmail and others. • Built-in multilayer security levels make configuration effort equal to a car radio. • It combines simplicity with effective protection using powerful open source...
    Leader badge
    Downloads: 119,040 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.