AI-powered document analysis and tagging for Paperless-ngx
A Repo For Document AI
A high-quality tool for convert PDF to Markdown and JSON
Document (PDF, Word, PPTX ...) extraction and parse API
Get your documents ready for gen AI
Text mining using tidy tools
Open source semantic search and text analytics for large document sets
A Model Context Protocol (MCP) server implementation
Document content and metadata extraction microservice
ExtractThinker is a Document Intelligence library for LLMs
Private chat with local GPT with document, images, video, etc.
PHP low-level client for Elasticsearch
Clean network diagrams, One-time setup, zero upkeep
LongBench v2 and LongBench (ACL 25'&24')
A system for agentic LLM-powered data processing and ETL
Full-stack Open-source Self-Evolving General AI Agent
Multi-tool for semantic search
RAG-Anything: All-in-One RAG Framework
A general-purpose tool for dynamic report generation in R
Autonomous agents for everyone
Research-oriented chatbot framework
Cross-platform SDK for creating and modifying PDF documents
Extract and convert data from any document, images, pdfs, word doc
Open-Source Financial Large Language Models
Unified framework for building enterprise RAG pipelines