Open Source OCR Engine
Handwritten Text Recognition (HTR) system implemented with TensorFlow
Awesome multilingual OCR toolkits based on PaddlePaddle
A framework to enable multimodal models to operate a computer
Contexts Optical Compression
OCR software, free and offline
A pure Javascript Multilingual OCR
AI Agent Application Development Framework
Open source semantic search and text analytics for large document sets
Crowdsourcing platform for full text transcription and tagging
Enhances Tesseract OCR output using LLMs (local or API)
A cross-platform software for text translation and recognition
Accurate × Fast × Comprehensive
OCRmyPDF adds an OCR text layer to scanned PDF files
Powerful Android AI agent with tools, automation, and Linux shell
High-performance neural network inference framework for mobile
Open source AI VTuber platform with voice chat and Live2D avatars
OCR expert VLM powered by Hunyuan's native multimodal architecture
Visual Causal Flow
A simple tool for reading in poorly redacted documents
The media player for language learning, with dual subtitles
A Commander for modern Go CLI interactions
AI assistant based on large models that can actively think and plan
Towards Studio-Grade Character Animation via In-Context Learning of 3D
An on-premises, OCR-free unstructured data extraction