Text AnalysisThe goal of this project is to provide a java implementation, with an easy to use API and full unit-test coverage, of some techniques to perform the following analysis:
Text Language Detection Keywords and keyphrases extraction Text Classification Text Clustering Document Summarization (single or multiple documents) Plagiarism Detection A brief overview about these methods is in this paper.
Practical examples an case of study of this techniques are: TODO
Actually this projects is still under active development, unusable for production scopes.
News30.08.2008 Reached 80% class testing coverage 16.08.2008 Quality Assurance: added Findbugs analysis and testing coverage measure with Emma.
Use Patent Claims
Include Install Instructions
These details are provided for information only. No information here is legal advice and should not be used as such.