dude uncomplicated data extraction: A simple framework
Turn entire websites into LLM-ready markdown or structured data
Crawl a website starting from a URL, find relevant pages
AI-ready web crawler that extracts and structures website content
Clone any website with one command using AI coding agents
ExtractThinker is a Document Intelligence library for LLMs
CLI tool to extract (meta)data from PDF and manipulate PDF files
Lightweight library for scraping web-sites with LLMs
Structured data extraction and instruction calling with ML, LLM
PDF Parser for AI-ready data. Automate PDF accessibility
MD/.JSON Document OCR and structured data extraction API
Unreal Engine Archives Explorer
Clean network diagrams, One-time setup, zero upkeep
Automate browser-based workflows with LLMs and Computer Vision
Open source web scraping system for automated data collection tasks
No-code LLM Platform to launch APIs and ETL Pipelines
To extract main article from given URL with Node.js
Fast and efficient unstructured data extraction
Model Context Protocol server that integrates AgentQL's data
A collection of hacks and one-off scripts
A chrome extension for automating your browser by connecting blocks
Automatic extraction of relevant features from time series
Tools to build web AI agents that can authenticate
Library for extracting streaming site data without official APIs
Did you say you like data?