Open Source OCR Engine
A pure Javascript Multilingual OCR
OCR software, free and offline
Enhances Tesseract OCR output using LLMs (local or API)
Contexts Optical Compression
Accurate × Fast × Comprehensive
PDF to Markdown with vision models
Visual Causal Flow
OCRmyPDF adds an OCR text layer to scanned PDF files
Formula recognition based on LaTeX-OCR and ONNXRuntime
Fast and efficient unstructured data extraction
OCR offline image text recognition command line windows program
Awesome multilingual OCR toolkits based on PaddlePaddle
A community-supported supercharged version of paperless
Ready-to-use OCR with 80+ supported languages
Screenshots, word marking, OCR, AI, translation software
Free OCR Software: No internet required, easy to use.
A high-quality tool for convert PDF to Markdown and JSON
Library for OCR-related tasks powered by Deep Learning
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Multilingual Document Layout Parsing in a Single Vision-Language Model
A cross-platform software for text translation and recognition
OCR expert VLM powered by Hunyuan's native multimodal architecture
Convert AI papers to GUI