Features

  • Document indexing and selection using Apache's Lucene
  • Fast VSM generation with several local and global weights (term - doc matrix)
  • Dimensionality reduction using SVD or NMF for LSA or related.
  • Meta-data annotators (PennTree grammar parsing).
  • Operations: Document distances, topic clustering, keyword extraction, and many more!

Project Activity

See All Activity >

License

Apache License V2.0

Follow TML - Text Mining Library for LSA & CMM

TML - Text Mining Library for LSA & CMM Web Site

Other Useful Business Software
AI-powered service management for IT and enterprise teams Icon
AI-powered service management for IT and enterprise teams

Enterprise-grade ITSM, for every business

Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
Try it Free
Rate This Project
Login To Rate This Project

User Ratings

★★★★★
★★★★
★★★
★★
3
0
0
0
0
ease 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5
features 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5
design 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5
support 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5

User Reviews

  • It seems to be good, but there are some errors that dont let the program load correctly the library ( Abstract Annotator constructor receives parameters but PennTreeAnnotator doesnt receive)
  • very good library for doing text mining
  • great
Read more reviews >

Additional Project Details

Intended Audience

Science/Research, Developers

User Interface

Command-line

Programming Language

Java

Database Environment

MySQL

Related Categories

Java Artificial Intelligence Software, Java Linguistics Software, Java Research Software

Registered

2009-11-11