Lucon News

Lucene based concordancer

Status: Beta

Brought to you by: catalinmititelu

Lucon / News: Recent posts

Lucon release

Lucon (Lucene based concordancer) is a user friendly graphical concordancer.
It can be used by researchers, teachers, students in any language.
Lucon runs on (UTF-8) plain text and xml files.
It can run on one or more files or even on an entire folder.
It creates the list of words and their frequency of occurrence in the searched files.
It extracts only relevant words from xml files, ignoring element names or name and value of attributes.
It displays the list of words and their associated frequency of occurrence, a small left and right context of the searched word (the user can set the length of this context) and a larger context.
It allows for fast search: while the user introduces characters, the list of words is modified, so that only the matching words are displayed.
It can also perform approximate search: given one word, it displays all words that have a close form to that one.
It can search for (compound) terms within a certain limit.
It can search through the attribute values (only for xml files), very useful when you want to search for a lemma (presupposing that you have an xml file in which every word is tagged and has a lemma attribute).
The list of words and their small contexts can be saved separately in a file for further analyses.
Lucon is developed in Java 1.5, thus being portable on any platform: Windows, Linux, Mac.
It uses Lucene as the main search engine, thus any search operation should be faster than traditional approaches and the application can handle huge size o data.

Posted by 2007-10-21