NlpTools is a library for natural language processing written in php. Its development is driven by the author's needs for text classification, clustering, tokenizing, stemming etc.
Spelt is a simple graphical program that can be used to classify words in a language. It is particularly designed to identify word roots and to classify them according to part-of-speech. The initial development of this program was specifically meant to simplify work on spell checkers, but you might find it useful for many other purposes.
CorpusCatcher is a corpus collection toolset. It can help you to build language or topic specific corpora from publicly available web resources. This can be very useful for many purposes, especially for data to build spell checkers.
Parallel text aligner dessigned to generate transation memories (TMX files) from two files tagged with any kind of XML-based tags. The application uses the tag structure and the text blok length to perform the alignment.
This site uses cookies to give you the best possible experience.
By using the site, you consent to our use of cookies.
For more information, please see our
Privacy Policy