Crawl-By-Example runs a crawl, which classifies the processed pages by subjects and finds the best pages according to examples provided by the operator. Crawl-By-Example is a plugin to the Heritrix crawler, and was done as a part of GSoC06 program.
License
GNU Library or Lesser General Public License version 2.0 (LGPLv2)Follow Crawl-By-Example (Heritrix plugin)
Other Useful Business Software
Auth for GenAI | Auth0
Easily implement secure login experiences for AI Agents - from interactive chatbots to background workers with Auth0. Auth for GenAI is now available in Developer Preview
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of Crawl-By-Example (Heritrix plugin)!