Tags : Browse Projects

Select a tag to browse associated projects and drill deeper into the tag cloud.

Natural Language Toolkit (NLTK)

Compare

  Analyzed about 5 hours ago

NLTK — the Natural Language Toolkit — is a suite of open source Python modules, linguistic data and documentation for research and development in natural language processing, supporting dozens of NLP tasks, with distributions for Windows, Mac OSX and Linux.

213K lines of code

54 current contributors

7 days since last commit

45 users on Open Hub

High Activity
5.0
 
I Use This

Apertium

Compare

  Analyzed 27 days ago

Apertium is an open-source machine translation platform, aimed at related-language pairs but expanded to deal with more divergent language pairs. The platform provides 1. a language-independent machine translation engine 2. tools to manage the linguistic data necessary to build a machine ... [More] translation system for a given language pair and 3. linguistic data for a growing number of language pairs. Apertium uses a shallow-transfer machine translation engine which processes the input text in stages, as in an assembly line: de-formatting, morphological analysis, part-of-speech disambiguation, shallow structural transfer, lexical transfer, morphological generation, and re-formatting. [Less]

44.1M lines of code

59 current contributors

28 days since last commit

12 users on Open Hub

Very High Activity
4.9
   
I Use This
Licenses: GFDL-1.2, GPL-2.0+, GPL-3.0+

Apache OpenNLP

Compare

Claimed by Apache Software Foundation Analyzed 29 days ago

Apache OpenNLP is a Java machine learning toolkit for natural language processing (NLP).

129K lines of code

19 current contributors

3 months since last commit

12 users on Open Hub

Moderate Activity
5.0
 
I Use This

Treex - NLP Framework

Compare

  Analyzed about 1 year ago

Treex (formerly TectoMT) is a highly modular NLP software system implemented in Perl programming language under Linux. It is primarily aimed at Machine Translation, making use of the ideas and technology created during the Prague Dependency Treebank project. At the same time, it is also hoped to ... [More] significantly facilitate and accelerate development of software solutions of many other NLP tasks, especially due to re-usability of the numerous integrated processing modules (called blocks), which are equipped with uniform object-oriented interfaces. [Less]

363K lines of code

22 current contributors

about 1 year since last commit

4 users on Open Hub

Activity Not Available
5.0
 
I Use This

matxin

Compare

  Analyzed 3 months ago

Machine translation engine based on a dependency grammar and XML interchange format. The Spanish-Basque (es-eu) translation direction is currently supported.

3.4M lines of code

4 current contributors

7 months since last commit

3 users on Open Hub

Activity Not Available
5.0
 
I Use This

OpenThesaurus

Compare

  Analyzed about 5 hours ago

A web-based application to build thesauri. Can export its data to text and OpenOffice/LibreOffice format.

10.2K lines of code

1 current contributors

8 days since last commit

2 users on Open Hub

Very Low Activity
5.0
 
I Use This

RelEx Semantic Relationship Extractor

Compare

  Analyzed 2 days ago

RelEx is an English-language semantic relationship extractor, built on the Carnegie-Mellon Link Grammar parser. It can identify dependency-grammar dependencies, such as subject, object, indirect object and many other relationships between words in a sentence. It can also provide part-of-speech ... [More] tagging, noun-number tagging, verb tense tagging, gender tagging, and so on. Relex includes a basic implementation of the Hobbs anaphora (pronoun) resolution algorithm. RelEx also provides semantic relationship framing, similar to that of FrameNet. [Less]

12.1K lines of code

4 current contributors

about 2 months since last commit

2 users on Open Hub

Moderate Activity
0.0
 
I Use This

SimMetrics

Compare

  Analyzed about 12 hours ago

SimMetrics is a Similarity Metric Library, e.g. from edit distance's (Levenshtein, Gotoh, Jaro etc) to other metrics, (e.g Soundex, Chapman). Work provided by UK Sheffield University funded by (AKT) an IRC sponsored by EPSRC, grant number GR/N15764/01.

5.76K lines of code

0 current contributors

over 10 years since last commit

2 users on Open Hub

Inactive
5.0
 
I Use This
Licenses: No declared licenses

Affisix

Compare

  Analyzed 14 days ago

Affisix is a program for automatic recognition of affixes. It takes large amount of words and according to the user setting it tries to determine which segments of these words are prefixes.

6.13K lines of code

0 current contributors

over 5 years since last commit

1 users on Open Hub

Inactive
4.0
   
I Use This

Ruby LinkParser

Compare

  Analyzed 10 months ago

A high-level interface to the CMU Link Grammar. This binding wraps the link-grammar shared library provided by the AbiWord project for their grammar-checker.

2.38K lines of code

0 current contributors

over 2 years since last commit

1 users on Open Hub

Activity Not Available
0.0
 
I Use This