Tags : Browse Projects

Select a tag to browse associated projects and drill deeper into the tag cloud.

NLPBamboo

Compare

  Analyzed about 1 year ago

bamboo is a chinese natrual language processing system. Currently, it includes chinese word tokenization, part of speech tagging and name entity recognition. bamboo是一个中文语言处理系统。目前包括中文分词、词性标注和命名实体识别。

49.8K lines of code

0 current contributors

over 6 years since last commit

5 users on Open Hub

Activity Not Available
5.0
 
I Use This

Ruby Linguistics

Compare

  Analyzed 9 months ago

A generic, language-neutral framework for extending Ruby objects with linguistic methods.

13.2K lines of code

1 current contributors

about 1 year since last commit

4 users on Open Hub

Activity Not Available
0.0
 
I Use This

Treex - NLP Framework

Compare

  Analyzed 10 months ago

Treex (formerly TectoMT) is a highly modular NLP software system implemented in Perl programming language under Linux. It is primarily aimed at Machine Translation, making use of the ideas and technology created during the Prague Dependency Treebank project. At the same time, it is also hoped to ... [More] significantly facilitate and accelerate development of software solutions of many other NLP tasks, especially due to re-usability of the numerous integrated processing modules (called blocks), which are equipped with uniform object-oriented interfaces. [Less]

363K lines of code

22 current contributors

10 months since last commit

4 users on Open Hub

Activity Not Available
5.0
 
I Use This

MeCab

Compare

  Analyzed about 5 hours ago

MeCab is a fast and customizable Japanese morphological analyzer. MeCab is designed for generic purpose and applied to variety of NLP tasks, such as Kana-Kanji conversion. MeCab provides parameter estimation functionalities based on CRFs and HMM

253K lines of code

0 current contributors

over 7 years since last commit

3 users on Open Hub

Inactive
0.0
 
I Use This
Licenses: No declared licenses

Ruby WordNet

Compare

  Analyzed 7 months ago

Ruby-WordNet is a Ruby interface to the WordNet® Lexical Database. WordNet? is an online lexical reference system whose design is inspired by current psycholinguistic theories of human lexical memory. English nouns, verbs, adjectives and adverbs are organized into synonym sets, each representing one ... [More] underlying lexical concept. Different relations link the synonym sets. [Less]

242 lines of code

0 current contributors

almost 2 years since last commit

3 users on Open Hub

Activity Not Available
0.0
 
I Use This

gensim

Compare

  Analyzed about 2 months ago

Gensim is a Python library for topic modelling, document indexing and similarity retrieval with large corpora. Target audience is the natural language processing (NLP) and information retrieval (IR) community.

54.4K lines of code

92 current contributors

3 months since last commit

3 users on Open Hub

Activity Not Available
5.0
 
I Use This

airhead-research

Compare

  Analyzed about 1 year ago

The S-Space Package is a collection of algorithms for building Semantic Spaces. These algorithms process text corpora and map semantic representations for words onto high dimensional vectors. These approaches are known by many names, such as word spaces, semantic spaces, or distributed semantics. ... [More] The research and development is being done by the Natural Language Processing group at UCLA led by David Jurgens and Keith Stevens, under the advisory of Dr. Michael Dyer. Our initial goal is to provide a uniform implementation for many common semantic space algorithms in order to facilitate researc [Less]

402K lines of code

0 current contributors

about 2 years since last commit

3 users on Open Hub

Activity Not Available
5.0
 
I Use This

ClearTK

Compare

  Analyzed 7 months ago

ClearTK is a toolkit developed at the Center for Computational Language and Education Research (CLEAR) at the University of Colorado at Boulder. ClearTK provides a framework for developing statistical natural language processing components in Java. It is based on the Apache UIMA framework for text ... [More] analysis, and provides: A rich feature extraction library A common interface and wrappers for popular machine learning libraries based on models such as maximum entropy, support vector machines and conditional random fields. Infrastructure for creating NLP components such as sequential taggers, chunkers, syntactic parsers, semantic role labeling, temporal resolution, etc. Collection readers for commonly used corpora wrappers for common NLP components such as the Snowball stemmer and OpenNLP sy [Less]

55.7K lines of code

4 current contributors

8 months since last commit

3 users on Open Hub

Activity Not Available
0.0
 
I Use This

Cascading

Compare

  Analyzed 10 months ago

Cascading is a feature rich API for defining and executing complex and fault tolerant data processing workflows on a Hadoop cluster.

106K lines of code

0 current contributors

over 3 years since last commit

2 users on Open Hub

Activity Not Available
0.0
 
I Use This

matxin

Compare

  Analyzed about 2 months ago

Machine translation engine based on a dependency grammar and XML interchange format. The Spanish-Basque (es-eu) translation direction is currently supported.

3.4M lines of code

5 current contributors

4 months since last commit

2 users on Open Hub

Activity Not Available
5.0
 
I Use This