A Java implementation of a flexible and extensible web spider engine.
Optional modules allow functionality to be added (searching dead links, testing the performance and scalability of a site, creating a sitemap, etc ..
License
GNU Library or Lesser General Public License version 2.0 (LGPLv2)Follow JSpider
Other Useful Business Software
Cut Data Warehouse Costs up to 54% with BigQuery
BigQuery delivers up to 54% lower TCO than cloud alternatives. Migrate from legacy or competing warehouses using free BigQuery Migration Service with automated SQL translation. Get serverless scale with no infrastructure to manage, compressed storage, and flexible pricing—pay per query or commit for deeper discounts. New customers get $300 in free credit.
Rate This Project
Login To Rate This Project
User Reviews
-
Hello, I am a MSc student in Addis Ababa University department of computer science, Ethiopia. I am working my MSc thesis on language specific search engine. So, I am using JSpider as a crawling tool. Since I am new for the tool, what configurations should I do on the tool so that it can be used for crawling pages from the web? Regards HB
-
good job
-
No progress on this, I suppose... Crawler4j(2008) and Nutch(2009) are the on going ones and have stable releases also ...
-
Seems outdated. The linkchecker doesn't work; results in exceptions.