Tags : Browse Projects

Select a tag to browse associated projects and drill deeper into the tag cloud.

WEKA

Compare

  Analyzed 12 months ago

707K lines of code

3 current contributors

about 1 year since last commit

37 users on Open Hub

Activity Not Available
3.93333
   
I Use This
Licenses: No declared licenses

YALE -- Open-Source Java Data Mining

Compare

  Analyzed over 1 year ago

YALE (Yet Another Learning Environment) is the most comprehensive open-source software for intelligent data analysis, data mining, knowledge discovery, machine learning, predictive analytics, forecasting, and analytics in business intelligence (BI). YALE provides more than 400 data mining operators ... [More] , a graphical user interface (GUI), an online tutorial with hands-on data mining applications, a comprehensive PDF tutorial, many visualization schemes for data sets and data mining results, many different learning and meta-learning schemes ranging from decision tree and rule learners to neural networks, SVMs, ensemble methods, etc. YALE is implemented in Java and available under GPL (GNU General Public License) as well as under a developer license (OEM license) for closed-source developers. [Less]

3.53M lines of code

3 current contributors

over 2 years since last commit

17 users on Open Hub

Activity Not Available
4.25
   
I Use This
Licenses: No declared licenses

gCube

Compare

  Analyzed 3 months ago

gCube is a framework dedicated to scientists. It enables the declarative and interactive creation of transient Virtual Research Environments that aggregate and deploy on-demand content resources and application services by exploiting computational and storage resources offered by private and commercial cloud providers.

22.7M lines of code

3 current contributors

3 months since last commit

11 users on Open Hub

Activity Not Available
5.0
 
I Use This

OpenRefine

Compare

  Analyzed 3 days ago

OpenRefine is a free, open source power tool for working with messy data and improving it

-10 lines of code

10 current contributors

3 months since last commit

4 users on Open Hub

Low Activity
5.0
 
I Use This

SimMetrics

Compare

  Analyzed about 2 months ago

SimMetrics is a Similarity Metric Library, e.g. from edit distance's (Levenshtein, Gotoh, Jaro etc) to other metrics, (e.g Soundex, Chapman). Work provided by UK Sheffield University funded by (AKT) an IRC sponsored by EPSRC, grant number GR/N15764/01.

5.76K lines of code

0 current contributors

over 10 years since last commit

2 users on Open Hub

Activity Not Available
5.0
 
I Use This
Licenses: No declared licenses

Java Data Mining Package (JDMP)

Compare

  Analyzed 3 days ago

The Java Data Mining Package (JDMP) is an open source Java library for data analysis and machine learning. It facilitates the access to data sources and machine learning algorithms (e.g. clustering, regression, classification, graphical models, optimization) and provides visualization modules. It ... [More] includes a matrix library for storing and processing any kind of data, with the ability to handle very large matrices even when they do not fit into memory. Import and export interfaces are provided for JDBC data bases, TXT, CSV, Excel, Matlab, Latex, MTX, HTML, WAV, BMP and other file formats. JDMP provides a number of algorithms and tools, but also interfaces to other machine learning and data mining packages (Weka, LibSVM, Mallet, Lucene, Octave). [Less]

40.7K lines of code

0 current contributors

almost 2 years since last commit

2 users on Open Hub

Very Low Activity
0.0
 
I Use This

MyMediaLite

Compare

  Analyzed 3 days ago

MyMediaLite is a recommender system algorithm library. It provides methods for two common tasks in recommender systems/collaborative filtering: rating prediction and item prediction from implicit feedback. MyMediaLite also contains command-line programs that let you use much of the library's functionality without having to program.

174K lines of code

3 current contributors

3 months since last commit

1 users on Open Hub

Low Activity
5.0
 
I Use This

dishevelled

Compare

  Analyzed 10 months ago

dishevelled.org hosts Free and Open Source libraries for various user interface components and supporting code, with emphasis on views and editors for complex data structures, like collections, sets, lists, maps, graphs, and matrices.

174K lines of code

1 current contributors

almost 2 years since last commit

1 users on Open Hub

Activity Not Available
0.0
 
I Use This

SOOT

Compare

  Analyzed 4 days ago

The Perl wrapper for CERN's ROOT library, a comprehensive data analysis framework. SOOT is very similar to the Ruby-ROOT or PyROOT extensions for their respective languages. Specifically, the first revision of SOOT was implemented after the model of Ruby-ROOT. SOOT uses a very dynamic approach ... [More] to wrapping a very large and quickly evolving library. Due to the dynamic nature (using the CInt introspection), SOOT is able to handle most of the ROOT classes without explicitly wrapping them. [Less]

115K lines of code

1 current contributors

9 months since last commit

1 users on Open Hub

Very Low Activity
5.0
 
I Use This

ELKI

Compare

  Analyzed 2 days ago

ELKI: "Environment for Developing KDD-Applications Supported by Index-Structures" is a development framework for data mining algorithms written in Java. It includes a large variety of popular data mining algorithms, distance functions and index structures. Its focus is particularly on clustering ... [More] and outlier detection methods, in contrast to many other data mining toolkits that focus on classification. Additionally, it includes support for index structures to improve algorithm performance such as R*-Tree and M-Tree. The modular architecture is meant to allow adding custom components such as distance functions or algorithms, while being able to reuse the other parts for evaluation. [Less]

186K lines of code

8 current contributors

4 months since last commit

1 users on Open Hub

Moderate Activity
5.0
 
I Use This