Today version 1.0.3 of the UCLA MII group's NLP Toolkit was released. This release includes a graphical tool for tagging documents to create training data for the de-identification training tool. Also, the de-identification training tool GUI has been reworked and beautified. For more information on these tools, see http://www.mii.ucla.edu/nlp/guide/training/index.html.
The UCLA MII group is pleased to announce the open source release of its NLP Toolkit. Included in this preliminary release is the framework for a general-purpose NLP engine, NLP-based de-identification tools for free-text medical reports, and additional supporting Java classes for working with the NLP engine. In particular, this early release focuses on boundary detection (section and sentence). Future updates to this code base will include lexical analysis, parsing, semantic analysis, and frame building. Examples, source code, and supporting documentation are provided through this web site.