Tags : Browse Projects

Select a tag to browse associated projects and drill deeper into the tag cloud.

IceNLP

Compare

  Analyzed 10 months ago

IceNLP is an open source Natural Language Processing (NLP) toolkit for analyzing and processing Icelandic text. The toolkit is implemented in Java.

31.2K lines of code

0 current contributors

almost 4 years since last commit

2 users on Open Hub

Activity Not Available
5.0
 
I Use This
Licenses: No declared licenses

VISL Constraint Grammar Compiler

Compare

  Analyzed 10 months ago

The VISL Constraint Grammar Compiler is a natural language parser generator. It is an implementation of Pasi Tapanainen's CG-2 constraint grammar formalism. VISL CG-3 is feature-wise backwards compatible with CG-2 and VISLCG.

30.7K lines of code

3 current contributors

10 months since last commit

2 users on Open Hub

Activity Not Available
5.0
 
I Use This

statgram-fr

Compare

  No analysis available

WARNING : Do not put non GPL resources in the repository : e.g. TnT or the French Treebank This is a toolkit for training and evaluating statistical parsers acquired from The French Treebank This is a treebank manipulation toolkit that allows to work specifically with the French Treebank. Though ... [More] it may be used as well with other treebanks like the Penn TreeBank. The goal is to prototype from existing tools an accurate parser for constituent and functional dependency parsing of French relying on statistical methods. There is a Wiki for further information (link) How to use the toolsSome bits of docs on the tools developped so far... Tree manipulation Parsers (TODO) Machine Learning Some featuresMain functionalities: treebank query/manipulationrecode facilities : utilities to convert different formats into each other : French TreeBank_, PennTreeBank_ and Ims (basic) tgrep : quick oncordances from corpora (basic) tsed : quick predefined transformation of the corpus (basic) twc : stats from corpora with outputs easy to use with e.g. R (basic) tdiff : find differences between trees (not implemented yet) Reuse existing toolse.g. The berkeley parser Collins/Bikel parser (not yet implemented) TnT LNCKY (CKY parser implemented by M. johnson) evalb XLE dependency annotation/evaluation tools Informal Schedule/PlanMaximise the constituent parsing accuracy Given constituent Structure, perform some functional role labelling [Less]

0 lines of code

0 current contributors

0 since last commit

0 users on Open Hub

Activity Not Available
0.0
 
I Use This
Mostly written in language not available
Licenses: GPL-2.0+

albanianwordnet

Compare

  Analyzed about 1 year ago

Here at the university of Vlora: http://univlora.edu.al : we are creating a wordned to map the structure of the albanian language in similar fation as the english wordnet developed at Princeton university. One of the goals of this project is to create an open framework on which other interesting ... [More] projects may be built. Currently our team aims to build an Albanian definition to word mapper and a simple Albanian-English machine translation tool. [Less]

0 lines of code

0 current contributors

about 9 years since last commit

0 users on Open Hub

Activity Not Available
0.0
 
I Use This
Mostly written in language not available
Licenses: artistic_gpl

chitrakavya

Compare

  Analyzed about 1 year ago

To provide set of software/hardware solutions which simplifies the use of Sanskrit on Computer and let the user appreciate the flexiblitity and richness Sanskrit provides. These solutions, will contribute to the productivity of user who is willing to use Sanskrit language on computers.

0 lines of code

0 current contributors

about 8 years since last commit

0 users on Open Hub

Activity Not Available
0.0
 
I Use This
Mostly written in language not available
Licenses: LGPL

uncorpora

Compare

  Analyzed about 1 year ago

http://www.uncorpora.org is a research oriented collection of United Nations documents. This projects contains utilities and examples to process the corpora.

1.06K lines of code

0 current contributors

almost 8 years since last commit

0 users on Open Hub

Activity Not Available
0.0
 
I Use This

supertagger

Compare

  Analyzed about 1 year ago

A toolkit for CRF-based sequence labelling, developed specifically for supertagging applications.

7.69K lines of code

0 current contributors

almost 8 years since last commit

0 users on Open Hub

Activity Not Available
0.0
 
I Use This

samskrit

Compare

  Analyzed about 1 year ago

Tools to explore Sanskrit as a framework and its logical system.

0 lines of code

0 current contributors

over 7 years since last commit

0 users on Open Hub

Activity Not Available
0.0
 
I Use This
Mostly written in language not available
Licenses: Apache-2.0

japanese-dependency-vectors

Compare

  No analysis available

This tool creates semantic vector space models based on dependency grammar relationships. It is based on the research of Sebastian Pado and is modeled after his excellent DependencyVectors software (which focuses on English text), although it was written from scratch without using code from that project's codebase.

0 lines of code

0 current contributors

0 since last commit

0 users on Open Hub

Activity Not Available
0.0
 
I Use This
Mostly written in language not available
Licenses: EPL-1.0

pythonbeta

Compare

  Analyzed about 1 year ago

An open source implementation of Beta software. Beta, originally developed by Benny Brodda on 1970's, can be used for corpus work, such as processing and analyzing text. Briefly, Beta takes a set of rules and the text to be processed as input and processes the text according to the rules.

468 lines of code

0 current contributors

over 8 years since last commit

0 users on Open Hub

Activity Not Available
0.0
 
I Use This