Open Source OCR Engine
Contexts Optical Compression
PDF to Markdown with vision models
Formula recognition based on LaTeX-OCR and ONNXRuntime
Free OCR Software: No internet required, easy to use.
OCRmyPDF adds an OCR text layer to scanned PDF files
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Awesome multilingual OCR toolkits based on PaddlePaddle
A pure Javascript Multilingual OCR
Ready-to-use OCR with 80+ supported languages
Open Source Document Management System for Digital Archives
PDF scientific paper translation with preserved formats
A high-quality tool for convert PDF to Markdown and JSON
Library for OCR-related tasks powered by Deep Learning
Qwen3-VL, the multimodal large language model series by Alibaba Cloud
Web application that allows you to perform operations on PDF files
A community-supported supercharged version of paperless
A Repo For Document AI
Free Open Source Enterprise Grade RPA
Math OCR model that outputs LaTeX and markdown
Assist in organizing your piles of documents
WindowTextExtractor allows you to get a text from any OS
A framework to enable multimodal models to operate a computer
Open source clipboard management tools for Windows, Macos and Linux