Tags : Browse Projects

Select a tag to browse associated projects and drill deeper into the tag cloud.

MADlib

Compare

  Analyzed about 17 hours ago

MADlib® is an open-source library for scalable in-database analytics. It provides data-parallel implementations of mathematical, statistical and machine learning methods for structured and unstructured data.

119K lines of code

0 current contributors

about 1 year since last commit

2 users on Open Hub

Very Low Activity
4.0
   
I Use This

BioMart

Compare

  No analysis available

BioMart is a query-oriented data management system developed jointly by the European Bioinformatics Institute (EBI) and Cold Spring Harbor Laboratory (CSHL). The system can be used with any type of data and comes with a range of query interfaces and administration tools, including 'out of the ... [More] box' website that can be installed, configured and customised according to requirements. The system simplifies the task of creation and maintenance of advanced query interfaces backed by a relational database and it is particularly suited for providing the 'data mining' like searches of complex descriptive (e.g. biological) data. BioMart can work with existing data repositories by converting them to a required BioMart format as well as newly created databases. [Less]

0 lines of code

0 current contributors

0 since last commit

2 users on Open Hub

Activity Not Available
4.0
   
I Use This
Mostly written in language not available
Licenses: lgpl

Cascading

Compare

  No analysis available

Cascading is a feature rich API for defining and executing complex and fault tolerant data processing workflows on a Hadoop cluster.

0 lines of code

0 current contributors

0 since last commit

2 users on Open Hub

Activity Not Available
0.0
 
I Use This
Mostly written in language not available
Licenses: apache_2

Java Data Mining Package (JDMP)

Compare

  Analyzed about 24 hours ago

The Java Data Mining Package (JDMP) is an open source Java library for data analysis and machine learning. It facilitates the access to data sources and machine learning algorithms (e.g. clustering, regression, classification, graphical models, optimization) and provides visualization modules. It ... [More] includes a matrix library for storing and processing any kind of data, with the ability to handle very large matrices even when they do not fit into memory. Import and export interfaces are provided for JDBC data bases, TXT, CSV, Excel, Matlab, Latex, MTX, HTML, WAV, BMP and other file formats. JDMP provides a number of algorithms and tools, but also interfaces to other machine learning and data mining packages (Weka, LibSVM, Mallet, Lucene, Octave). [Less]

40.7K lines of code

0 current contributors

over 8 years since last commit

2 users on Open Hub

Inactive
0.0
 
I Use This

Crab - Scikit-Recommender

Compare

  Analyzed about 11 hours ago

Crab is a flexible, fast recommender engine for Python that integrates classic information filtering recommendation algorithms in the world of scientific Python packages (NumPy,SciPy, Matplotlib). The engine aims to provide a rich set of components from which you can construct a customized ... [More] recommender system from a set of algorithms. It is designed for scability, flexibility and performance making use of scientific optimized python packages in order to provide simple and efficient solutions that are acessible to everybody and reusable in various contexts: science and engineering. [Less]

4.21K lines of code

0 current contributors

about 12 years since last commit

2 users on Open Hub

Inactive
5.0
 
I Use This

Sumatra

Compare

  Analyzed about 8 hours ago

Sumatra is a tool for managing and tracking projects based on numerical simulation or analysis, with the aim of supporting reproducible research. It can be thought of as an automated electronic lab notebook for simulation/analysis projects.

14.3K lines of code

3 current contributors

over 3 years since last commit

1 users on Open Hub

Inactive
0.0
 
I Use This
Licenses: No declared licenses

Apache PredictionIO

Compare

  No analysis available

Apache PredictionIO is an open source machine learning server. It enables developers and data engineers to build smarter web and mobile applications through a simple set of APIs. Admin UI is provided for developers to select and tune algorithms. Some benefits of using Apache PredictionIO: - ... [More] create predictive features quickly with built-in algorithms. - build your own ML algorithms on top of a state-of-the-art infrastructure. - find the best algorithm for your application. - handle big data well - PredictionIO is very scalable. - serve real-time prediction queries through robust APIs and SDKs. [Less]

0 lines of code

11 current contributors

0 since last commit

1 users on Open Hub

Activity Not Available
5.0
 
I Use This
Mostly written in language not available
Licenses: apache_2

MinorThird

Compare

  Analyzed about 19 hours ago

MinorThird is an SDK/API for machine learning and information extraction, primarily on text data. A range of algorithms are included and is integrated tightly with the visualization tools for manual and automatic annotation of text.

61.1K lines of code

0 current contributors

about 13 years since last commit

1 users on Open Hub

Inactive
0.0
 
I Use This
Licenses: No declared licenses

MLPACK C++ machine learning library

Compare

  Analyzed 4 months ago

MLPACK is a fast C++ machine learning library with an emphasis on scalability, speed, and ease-of-use. Its aim is to make machine learning possible for novice users by means of a simple, consistent API, while simultaneously exploiting C++ language features to provide maximum performance and maximum ... [More] flexibility for expert users. It contains algorithms such as k-means, Gaussian mixture models, hidden Markov models, density estimation trees, kernel PCA, locality-sensitive hashing, sparse coding, linear regression, least-angle regression, etc. [Less]

276K lines of code

71 current contributors

4 months since last commit

1 users on Open Hub

Activity Not Available
0.0
 
I Use This
Licenses: No declared licenses

Java Machine Learning Library

Compare

  Analyzed about 16 hours ago

The Java Machine Learning Library is a set of reference implementations of machine learning algorithms. These algorithms are well documented, both in the source code as on the documentation site. Besides real machine learning algorithms also a lot of supporting classes are provided: distance ... [More] measures, evaluation criteria, datasets for validation purposes and some sample code. Currently the library contains clustering algorithms, distance measures, wavelet transforms, fourier transforms, matrices, support vector machines and some other algorithms [Less]

19.9K lines of code

0 current contributors

about 11 years since last commit

1 users on Open Hub

Inactive
5.0
 
I Use This