WebCollector is an open source web crawler framework based on Java.It provides some simple interfaces for crawling the Web,you can setup a multi-threaded web crawler in less than 5 minutes.
Github:
https://github.com/CrawlScript/WebCollector
Demo:
https://github.com/CrawlScript/WebCollector/blob/master/YahooCrawler.java
License
GNU General Public License version 2.0 (GPLv2)Follow WebCollector
Other Useful Business Software
Train ML Models With SQL You Already Know
Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of WebCollector!