PHPCrawl is a high configurable webcrawler/webspider-library written in PHP. It supports filters, limiters, cookie-handling, robots.txt-handling, multiprocessing and much more.
License
GNU General Public License version 2.0 (GPLv2)Follow PHPCrawl
Other Useful Business Software
Fully Managed MySQL, PostgreSQL, and SQL Server
Cloud SQL handles your database ops end to end, so you can focus on your app.
Rate This Project
Login To Rate This Project
User Reviews
-
***A*W*E*S*O*M*E***
-
Wow, this crawler has it all. It is - even with a point zero release faster and more mature and feature rich than every other I tried. Especially useful is the test interface, where you can try out all the parameters, without coding. This is excellent work!
-
Great tool to crawl sites, excelent support
-
Could use a "Per domain page-limit" :)
-
Very good job. Hard to find better!