Tags : Browse Projects

Select a tag to browse associated projects and drill deeper into the tag cloud.

sunpinyin

Compare

  Analyzed 11 days ago

SunPinyin is an opensource'd (in CDDL/LGPLv2.1) and SLM (Statistical Language Model) based Chinese PinYin input method engine. Currently, it's available on all UNIX platforms including MacOSX.

43.7K lines of code

3 current contributors

3 months since last commit

95 users on Open Hub

Very Low Activity
4.76923
   
I Use This
Licenses: common_de..., lgpl

Natural Language Toolkit (NLTK)

Compare

  Analyzed 17 days ago

NLTK — the Natural Language Toolkit — is a suite of open source Python modules, linguistic data and documentation for research and development in natural language processing, supporting dozens of NLP tasks, with distributions for Windows, Mac OSX and Linux.

228K lines of code

42 current contributors

27 days since last commit

45 users on Open Hub

Moderate Activity
5.0
 
I Use This

GATE

Compare

  Analyzed about 2 months ago

0 lines of code

0 current contributors

0 since last commit

14 users on Open Hub

Activity Not Available
5.0
 
I Use This
Mostly written in language not available
Licenses: No declared licenses

Apertium

Compare

  Analyzed about 2 months ago

Apertium is an open-source machine translation platform, aimed at related-language pairs but expanded to deal with more divergent language pairs. The platform provides 1. a language-independent machine translation engine 2. tools to manage the linguistic data necessary to build a machine ... [More] translation system for a given language pair and 3. linguistic data for a growing number of language pairs. Apertium uses a shallow-transfer machine translation engine which processes the input text in stages, as in an assembly line: de-formatting, morphological analysis, part-of-speech disambiguation, shallow structural transfer, lexical transfer, morphological generation, and re-formatting. [Less]

33.6M lines of code

0 current contributors

almost 2 years since last commit

13 users on Open Hub

Activity Not Available
4.9
   
I Use This
Licenses: GNU_Free_..., gpl, gpl3_or_l...

TreeTagger for Java

Compare

  Analyzed 10 days ago

TreeTagger for Java is a Java wrapper around the popular TreeTagger package by Helmut Schmid. It was written with a focus on platform-independence and easy integration into applications. It is written in Java 5 and has been tested on OS X, Ubuntu Linux, and Windows.

2.67K lines of code

0 current contributors

about 4 years since last commit

12 users on Open Hub

Inactive
5.0
 
I Use This

Apache OpenNLP

Compare

Claimed by Apache Software Foundation Analyzed 17 days ago

Apache OpenNLP is a Java machine learning toolkit for natural language processing (NLP).

142K lines of code

8 current contributors

22 days since last commit

12 users on Open Hub

Very Low Activity
5.0
 
I Use This

LanguageTool

Compare

  Analyzed 13 days ago

LanguageTool is an Open Source language checker for English, German, Polish, Dutch, and other languages. It's rule based, i.e. it will find errors for which a rule is defined in an XML configuration files. Rules for more complicated errors can be written in Java.

645K lines of code

37 current contributors

17 days since last commit

11 users on Open Hub

Very High Activity
4.66667
   
I Use This

Stanford Parser

Compare

  Analyzed about 2 months ago

This is a Java natural language parser. From the home page: A natural language parser is a program that works out the grammatical structure of sentences, for instance, which groups of words go together (as "phrases") and which words are the subject or object of a verb. Probabilistic parsers ... [More] use knowledge of language gained from hand-parsed sentences to try to produce the most likely analysis of new sentences. These statistical parsers still make some mistakes, but commonly work rather well. Their development was one of the biggest breakthroughs in natural language processing in the 1990s. You can try out our parser online. [Less]

603K lines of code

18 current contributors

5 months since last commit

7 users on Open Hub

Activity Not Available
4.0
   
I Use This

DKPro Core

Compare

  Analyzed 17 days ago

DKPro Core is a collection of software components for natural language processing (NLP) based on the Apache UIMA framework. Many powerful and state-of-the-art NLP components are already freely available in the NLP research community. New and improved components are being developed and released ... [More] continuously. The components cover the whole range of NLP-related processing tasks. DKPro Core provides wrappers for such third-party tool as well as original NLP components. DKPro Core builds heavily on uimaFIT which allows for rapid and easy development of NLP processing pipelines. [Less]

170K lines of code

8 current contributors

about 2 months since last commit

6 users on Open Hub

Moderate Activity
4.75
   
I Use This

Stanford Named Entity Recognizer

Compare

  Analyzed about 2 months ago

Stanford NER (a.k.a., CRFClassifier) is a Java implementation of a Named Entity Recognizer. Named Entity Recognition (NER) labels sequences of words in a text which are the names of things, such as person and company names, or gene and protein names. The software provides a general (arbitrary order) ... [More] implementation of linear chain Conditional Random Field (CRF) sequence models, of the sort pioneered by Lafferty, McCallum, and Pereira (2001), coupled with well-engineered feature extractors for Named Entity Recognition. [Less]

603K lines of code

17 current contributors

5 months since last commit

6 users on Open Hub

Activity Not Available
4.0
   
I Use This