Tags : Browse Projects

Select a tag to browse associated projects and drill deeper into the tag cloud.

Natural Language Toolkit (NLTK)

Compare

  Analyzed about 8 hours ago

NLTK — the Natural Language Toolkit — is a suite of open source Python modules, linguistic data and documentation for research and development in natural language processing, supporting dozens of NLP tasks, with distributions for Windows, Mac OSX and Linux.

214K lines of code

61 current contributors

about 23 hours since last commit

45 users on Open Hub

Moderate Activity
5.0
 
I Use This

Apache UIMA

Compare

Claimed by Apache Software Foundation Analyzed almost 2 years ago

Apache UIMA is an Apache-licensed open source implementation of the UIMA specification (that specification is, in turn, being developed concurrently by a technical committee within OASIS, a standards organization). We invite and encourage you to participate in both the implementation and ... [More] specification efforts. UIMA is a component framework for analysing unstructured content such as text, audio and video. It comprises an SDK and tooling for composing and running analytic components written in Java and C++, with some support for Perl, Python and TCL. [Less]

1.05M lines of code

9 current contributors

about 2 years since last commit

20 users on Open Hub

Activity Not Available
5.0
 
I Use This

GATE

Compare

  Analyzed 2 months ago

0 lines of code

0 current contributors

0 since last commit

14 users on Open Hub

Activity Not Available
5.0
 
I Use This
Mostly written in language not available
Licenses: No declared licenses

Apache OpenNLP

Compare

Claimed by Apache Software Foundation Analyzed 9 months ago

Apache OpenNLP is a Java machine learning toolkit for natural language processing (NLP).

129K lines of code

19 current contributors

11 months since last commit

12 users on Open Hub

Activity Not Available
5.0
 
I Use This

TreeTagger for Java

Compare

  Analyzed about 7 hours ago

TreeTagger for Java is a Java wrapper around the popular TreeTagger package by Helmut Schmid. It was written with a focus on platform-independence and easy integration into applications. It is written in Java 5 and has been tested on OS X, Ubuntu Linux, and Windows.

2.67K lines of code

0 current contributors

about 2 years since last commit

12 users on Open Hub

Inactive
5.0
 
I Use This

LanguageTool

Compare

  Analyzed 9 days ago

LanguageTool is an Open Source language checker for English, German, Polish, Dutch, and other languages. It's rule based, i.e. it will find errors for which a rule is defined in an XML configuration files. Rules for more complicated errors can be written in Java.

518K lines of code

49 current contributors

10 days since last commit

10 users on Open Hub

Very High Activity
4.66667
   
I Use This

Stanford Parser

Compare

  Analyzed 2 months ago

This is a Java natural language parser. From the home page: A natural language parser is a program that works out the grammatical structure of sentences, for instance, which groups of words go together (as "phrases") and which words are the subject or object of a verb. Probabilistic parsers ... [More] use knowledge of language gained from hand-parsed sentences to try to produce the most likely analysis of new sentences. These statistical parsers still make some mistakes, but commonly work rather well. Their development was one of the biggest breakthroughs in natural language processing in the 1990s. You can try out our parser online. [Less]

574K lines of code

29 current contributors

3 months since last commit

7 users on Open Hub

Activity Not Available
4.0
   
I Use This

DKPro Core

Compare

  Analyzed over 1 year ago

DKPro Core is a collection of software components for natural language processing (NLP) based on the Apache UIMA framework. Many powerful and state-of-the-art NLP components are already freely available in the NLP research community. New and improved components are being developed and released ... [More] continuously. The components cover the whole range of NLP-related processing tasks. DKPro Core provides wrappers for such third-party tool as well as original NLP components. DKPro Core builds heavily on uimaFIT which allows for rapid and easy development of NLP processing pipelines. [Less]

138K lines of code

14 current contributors

over 1 year since last commit

6 users on Open Hub

Activity Not Available
4.75
   
I Use This
Licenses: Apache-2.0, GPL-3.0

Treex - NLP Framework

Compare

  Analyzed about 2 months ago

Treex (formerly TectoMT) is a highly modular NLP software system implemented in Perl programming language under Linux. It is primarily aimed at Machine Translation, making use of the ideas and technology created during the Prague Dependency Treebank project. At the same time, it is also hoped to ... [More] significantly facilitate and accelerate development of software solutions of many other NLP tasks, especially due to re-usability of the numerous integrated processing modules (called blocks), which are equipped with uniform object-oriented interfaces. [Less]

224K lines of code

6 current contributors

about 2 months since last commit

4 users on Open Hub

Activity Not Available
5.0
 
I Use This

MeCab

Compare

  Analyzed 2 months ago

MeCab is a fast and customizable Japanese morphological analyzer. MeCab is designed for generic purpose and applied to variety of NLP tasks, such as Kana-Kanji conversion. MeCab provides parameter estimation functionalities based on CRFs and HMM

33 lines of code

0 current contributors

over 11 years since last commit

3 users on Open Hub

Activity Not Available
0.0
 
I Use This
Licenses: No declared licenses