AI-powered document analysis and tagging for Paperless-ngx
Use LLMs and LLM Vision (OCR) to handle paperless-ngx
The standard data-centric AI package for data quality and ML
Open source NLP guide with models, methods, and real use cases
Unified framework for building enterprise RAG pipelines
Apache OpenNLP
AI-powered tool for efficient abstract and PDF screening
Free, local, open-source Cowork for Gemini CLI, Claude Code, Codex
A very simple framework for state-of-the-art NLP
Scalable data pre processing and curation toolkit for LLMs
Bringing BERT into modernity via both architecture changes and scaling
Access and use all DeepSeek AI models in one program.
Award-winning modern data processing SDK in C++20
e-Dokyumento is web-based Document Management System (DMS)
CPU/GPU inference server for Hugging Face transformer models
State-of-the-art explainers for text-based machine learning models
Innovative text document search. http://dynaq.opendfki.de for details.
Library for fast text classification and representation
Document/Text Classification using Naive Bayes model.
GPU-based Textual kNN (GT-kNN)
(audio, video, image) Multimedia Multimodal Information Retrieval
A machine learning system for supervised document classification