Projects tagged ‘datamining’

Apache Mahout

Claimed by Apache Software Foundation Analyzed about 22 hours ago

Apache Mahout's goal is to build scalable machine learning libraries. With scalable we mean: Scalable to reasonably large data sets. Our core algorithms for clustering, classfication and batch based collaborative filtering are implemented on top of Apache Hadoop using the map/reduce paradigm. ... [More]

146K lines of code

0 current contributors

2 months since last commit

25 users on Open Hub

Low Activity

0 Reviews

I Use This

Mostly written in Java

Licenses: apache_2

SpagoBI

Claimed by The OW2 Consortium No analysis available

SpagoBI is an integration platform focused on business intelligence needs at the enterprise level. It's a full open source solution, no professional edition. SpagoBI offers a complete analytical layer : reporting ,OLAP, data mining, dashboards, free and visual data inquiring, GIS. It is built on a ... [More]

0 lines of code

0 current contributors

0 since last commit

9 users on Open Hub

Activity Not Available

0 Reviews

I Use This

Mostly written in language not available

Licenses: mozilla_p...

Tags bi dashbord datamart datamining datawarehouse gis olap qbe reporting

lava-server

Claimed by Linaro No analysis available

Linaro Automated Validation Architecture server, including lava-scheduler and lava-dashboard.

0 lines of code

32 current contributors

0 since last commit

4 users on Open Hub

Activity Not Available

0 Reviews

I Use This

Mostly written in language not available

Licenses: AGPL3

Tags arm automation bootloader continuous_integration datamining django graphing ipxe lava linaro linaro_lava linux 8 more...

Crab - Scikit-Recommender

C

Analyzed about 16 hours ago

Crab is a flexible, fast recommender engine for Python that integrates classic information filtering recommendation algorithms in the world of scientific Python packages (NumPy,SciPy, Matplotlib). The engine aims to provide a rich set of components from which you can construct a customized ... [More]

4.21K lines of code

0 current contributors

about 12 years since last commit

2 users on Open Hub

Inactive

0 Reviews

I Use This

Mostly written in Python

Licenses: BSD-3-Clause

Tags ai algorithms collaborative_filtering collectiveintelligence data_mining datamining distributed evaluation library machine_learning machinelearning plugable 8 more...

SOFA Statistics

Analyzed 1 day ago

SOFA is a statistics, analysis, and reporting program with an emphasis on ease of use, learn as you go, and beautiful output.

37K lines of code

0 current contributors

3 months since last commit

1 users on Open Hub

Very Low Activity

0 Reviews

I Use This

Mostly written in Python

Licenses: No declared licenses

Tags database datamining python reporting statisticalanalysis statistics

Sally Tool

Analyzed about 23 hours ago

Sally is a small tool for mapping a set of strings to a set of vectors. This mapping is referred to as embedding and allows for applying techniques of machine learning and data mining for analysis of string data. Sally implements a standard technique for mapping strings to a vector space that is ... [More]

5.62K lines of code

1 current contributors

almost 5 years since last commit

1 users on Open Hub

Inactive

0 Reviews

I Use This

Mostly written in C

Licenses: gpl3

Tags datamining machinelearning strings vectorspace

jMotif

Analyzed about 20 hours ago

JMotif implements in Java number of methods for timeseries data handling and analysis: * Z normalization of timeseries * Piecewise Aggregate Approximation (PAA) of timeseries * Symbolic Aggregate Approximation (SAX) of timeseries * iSAX (indexed SAX) in order to help one leverage the symbolic ... [More]

4.3K lines of code

0 current contributors

over 2 years since last commit

1 users on Open Hub

Inactive

0 Reviews

I Use This

Mostly written in Java

Licenses: gpl

Tags anomaly_detection behavior_analysis clustering datamining distance java kdd metrics paa patterns sax search 5 more...

refine-client-py

R

Analyzed about 23 hours ago

The Google Refine Python Client Library provides an interface to communicating with a Google Refine server.

1.41K lines of code

0 current contributors

over 9 years since last commit

1 users on Open Hub

Inactive

0 Reviews

I Use This

Mostly written in Python

Licenses: No declared licenses

Tags datamining etl gridworks python python27 refine

Knowing Datamining

Analyzed about 8 hours ago

Datamining framework based on WEKA and Akka.

8.58K lines of code

0 current contributors

over 9 years since last commit

1 users on Open Hub

Inactive

0 Reviews

I Use This

Mostly written in Scala

Licenses: apache_2

Tags data datamining eclipse fault-tolerance faulttolerance fault-tolerant framework java modular osgi scala weka

ELKI

Analyzed about 7 hours ago

ELKI: "Environment for Developing KDD-Applications Supported by Index-Structures" is a development framework for data mining algorithms written in Java. It includes a large variety of popular data mining algorithms, distance functions and index structures. Its focus is particularly on clustering ... [More]

214K lines of code

2 current contributors

3 days since last commit

1 users on Open Hub

Moderate Activity

0 Reviews

I Use This

Mostly written in Java

Licenses: AGPL3_or_...

Tags algorithms analysis api clustering data data_analysis data_mining datamining dataminingframework java kdd knowledge_discovery 8 more...

Tags : Browse Projects