Open Source OCR Engine
Handwritten Text Recognition (HTR) system implemented with TensorFlow
Awesome multilingual OCR toolkits based on PaddlePaddle
Contexts Optical Compression
OCR software, free and offline
ExDARK dataset is the largest collection of low-light images
A pure Javascript Multilingual OCR
Open source semantic search and text analytics for large document sets
Open Source Computer Vision Library
Crowdsourcing platform for full text transcription and tagging
Enhances Tesseract OCR output using LLMs (local or API)
A cross-platform software for text translation and recognition
Accurate × Fast × Comprehensive
A framework to enable multimodal models to operate a computer
OCRmyPDF adds an OCR text layer to scanned PDF files
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Code release for Cut and Learn for Unsupervised Object Detection
Open source AI VTuber platform with voice chat and Live2D avatars
A cross-platform video structuring (video analysis) framework
Interactive Machine Learning experiments
OCR expert VLM powered by Hunyuan's native multimodal architecture
Visual Causal Flow
A simple tool for reading in poorly redacted documents
HTTP server cookie parsing and serialization
Unofficial (Golang) Go bindings for the Hugging Face Inference API