Include Lucene index builder plug so that a searchable
index can be built from the pages being crawled. The
index than can be used to search crawled site...
Accepted for upcoming 1.0 release. Configurability remains an issue, mainly of Analyzers. I presently have it that you can name your own class if it just needs no-args construction, default StandardAnalyzer, and if you need to use a more complex one you override a getAnalyzer() method. Not sure what else can be done. It's a long time since I used Lucene and they keep changing the API, which doesn't help. Also it introduces a dependency on SLF4J grrr.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Accepted for upcoming 1.0 release. Configurability remains an issue, mainly of Analyzers. I presently have it that you can name your own class if it just needs no-args construction, default StandardAnalyzer, and if you need to use a more complex one you override a getAnalyzer() method. Not sure what else can be done. It's a long time since I used Lucene and they keep changing the API, which doesn't help. Also it introduces a dependency on SLF4J grrr.