Open Source OCR Engine
Awesome multilingual OCR toolkits based on PaddlePaddle
Contexts Optical Compression
Open source semantic search and text analytics for large document sets
OCR software, free and offline
Crowdsourcing platform for full text transcription and tagging
A framework to enable multimodal models to operate a computer
Handwritten Text Recognition (HTR) system implemented with TensorFlow
OCRmyPDF adds an OCR text layer to scanned PDF files
Enhances Tesseract OCR output using LLMs (local or API)
Accurate × Fast × Comprehensive
A cross-platform software for text translation and recognition
Visual Causal Flow
A simple tool for reading in poorly redacted documents
OCR expert VLM powered by Hunyuan's native multimodal architecture
The media player for language learning, with dual subtitles
An on-premises, OCR-free unstructured data extraction
A ranked list of awesome machine learning Python libraries
A pure Javascript Multilingual OCR
Assist in organizing your piles of documents
JavaScript OCR and text extraction for images and PDFs
Open source AI VTuber platform with voice chat and Live2D avatars
Towards Studio-Grade Character Animation via In-Context Learning of 3D
Framework for building AI-powered interactive digital humans and agent
A Python application to add watermarks (text or image) to PDF files