Tags : Browse Projects

Select a tag to browse associated projects and drill deeper into the tag cloud.

Natural Language Toolkit (NLTK)

Compare

  Analyzed 10 days ago

NLTK — the Natural Language Toolkit — is a suite of open source Python modules, linguistic data and documentation for research and development in natural language processing, supporting dozens of NLP tasks, with distributions for Windows, Mac OSX and Linux.

211K lines of code

47 current contributors

28 days since last commit

45 users on Open Hub

High Activity
5.0
 
I Use This

Apache OpenNLP

Compare

Claimed by Apache Software Foundation Analyzed 8 months ago

Apache OpenNLP is a Java machine learning toolkit for natural language processing (NLP).

460K lines of code

6 current contributors

8 months since last commit

12 users on Open Hub

Activity Not Available
5.0
 
I Use This

Apertium

Compare

  Analyzed 23 days ago

Apertium is an open-source machine translation platform, aimed at related-language pairs but expanded to deal with more divergent language pairs. The platform provides 1. a language-independent machine translation engine 2. tools to manage the linguistic data necessary to build a machine ... [More] translation system for a given language pair and 3. linguistic data for a growing number of language pairs. Apertium uses a shallow-transfer machine translation engine which processes the input text in stages, as in an assembly line: de-formatting, morphological analysis, part-of-speech disambiguation, shallow structural transfer, lexical transfer, morphological generation, and re-formatting. [Less]

43.5M lines of code

65 current contributors

5 months since last commit

12 users on Open Hub

Very High Activity
4.9
   
I Use This
Licenses: GFDL-1.2, GPL-2.0+, GPL-3.0+

Treex - NLP Framework

Compare

  Analyzed 8 months ago

Treex (formerly TectoMT) is a highly modular NLP software system implemented in Perl programming language under Linux. It is primarily aimed at Machine Translation, making use of the ideas and technology created during the Prague Dependency Treebank project. At the same time, it is also hoped to ... [More] significantly facilitate and accelerate development of software solutions of many other NLP tasks, especially due to re-usability of the numerous integrated processing modules (called blocks), which are equipped with uniform object-oriented interfaces. [Less]

363K lines of code

22 current contributors

8 months since last commit

4 users on Open Hub

Activity Not Available
5.0
 
I Use This

OpenThesaurus

Compare

  Analyzed 5 months ago

A web-based application to build thesauri. Can export its data to text and OpenOffice/LibreOffice format.

9.93K lines of code

2 current contributors

5 months since last commit

2 users on Open Hub

Activity Not Available
5.0
 
I Use This

matxin

Compare

  Analyzed 5 months ago

Machine translation engine based on a dependency grammar and XML interchange format. The Spanish-Basque (es-eu) translation direction is currently supported.

3.4M lines of code

4 current contributors

9 months since last commit

2 users on Open Hub

Activity Not Available
5.0
 
I Use This

RelEx Semantic Relationship Extractor

Compare

  Analyzed 19 days ago

RelEx is an English-language semantic relationship extractor, built on the Carnegie-Mellon Link Grammar parser. It can identify dependency-grammar dependencies, such as subject, object, indirect object and many other relationships between words in a sentence. It can also provide part-of-speech ... [More] tagging, noun-number tagging, verb tense tagging, gender tagging, and so on. Relex includes a basic implementation of the Hobbs anaphora (pronoun) resolution algorithm. RelEx also provides semantic relationship framing, similar to that of FrameNet. [Less]

15.5K lines of code

7 current contributors

about 1 month since last commit

2 users on Open Hub

Moderate Activity
0.0
 
I Use This

SimMetrics

Compare

  Analyzed 5 months ago

SimMetrics is a Similarity Metric Library, e.g. from edit distance's (Levenshtein, Gotoh, Jaro etc) to other metrics, (e.g Soundex, Chapman). Work provided by UK Sheffield University funded by (AKT) an IRC sponsored by EPSRC, grant number GR/N15764/01.

5.76K lines of code

0 current contributors

about 10 years since last commit

2 users on Open Hub

Activity Not Available
5.0
 
I Use This
Licenses: No declared licenses

Link Grammar

  Analyzed 19 days ago

The Link Grammar Parser is a syntactic parser of English, based on link grammar, an original theory of English syntax. Given a sentence, the system assigns to it a syntactic structure, which consists of a set of labeled links connecting pairs of words. The parser also produces a "constituent" (Penn ... [More] tree-bank style phrase tree) representation of a sentence (showing noun phrases, verb phrases, etc.). [Less]

65.1K lines of code

9 current contributors

about 1 month since last commit

1 users on Open Hub

Very High Activity
0.0
 
I Use This

Giellatekno

Compare

  Analyzed 4 months ago

Giellatekno, Centre for Saami language technology at the University of Tromsø, started as a project for Saami grammatical analysis, later extended into syntax, proofing tools, interactive pedagogical programs, electronic dictionaries, and text-to-speech. The linguistic philosophy is that programs ... [More] for linguistic analysis should be funded on deep linguistic knowledge, where the analysis takes word forms as a starting point, and build the syntactic analysis bottom-up, rather than vice versa. In this way we are able to build analysers that are both robust but at the same time give deep rather than shallow linguistic analyses. These analysers form the basis both for practical programs for end users and for advanced linguistic research. [Less]

76.1M lines of code

41 current contributors

10 months since last commit

1 users on Open Hub

Activity Not Available
5.0
 
I Use This