Projects tagged ‘mapreduce’

Apache CouchDB

Claimed by Apache Software Foundation Analyzed 1 day ago

CouchDb is a distributed document database system with bi-directional replication. It makes it simple to build collaborative applications that can be replicated offline by users, with full interactivity (query, add, update, delete), and later "synced up" with everyone else's changes when back online.

124K lines of code

63 current contributors

over 1 year since last commit

119 users on Open Hub

Very Low Activity

0 Reviews

I Use This

Mostly written in Erlang

Licenses: apache_2

Apache Spark

Claimed by Apache Software Foundation Analyzed 1 day ago

Apache Spark is an open source cluster computing system that aims to make data analytics fast — both fast to run and fast to write. To run programs faster, Spark provides primitives for in-memory cluster computing: your job can load data into memory and query it repeatedly more rapidly than with ... [More]

1.52M lines of code

374 current contributors

1 day since last commit

56 users on Open Hub

Very High Activity

0 Reviews

I Use This

Mostly written in Scala

Licenses: apache_2

Tags apache bigdata cluster clustercomputing distributed distributed_computing ec2 graph_computing hadoop hdfs in_memory java 8 more...

Apache Mahout

Claimed by Apache Software Foundation Analyzed about 23 hours ago

Apache Mahout's goal is to build scalable machine learning libraries. With scalable we mean: Scalable to reasonably large data sets. Our core algorithms for clustering, classfication and batch based collaborative filtering are implemented on top of Apache Hadoop using the map/reduce paradigm. ... [More]

146K lines of code

0 current contributors

over 1 year since last commit

25 users on Open Hub

Very Low Activity

0 Reviews

I Use This

Mostly written in Java

Licenses: apache_2

Tags algorithms classifiers clustering collaborative_filtering data_mining datamining dimension_reduction distributed distributed_computing hadoop java library 5 more...

Apache Hive

Claimed by Apache Software Foundation No analysis available

Hive is a data warehouse infrastructure built on top of Hadoop that provides tools to enable easy data summarization, adhoc querying and analysis of large datasets data stored in Hadoop files. It provides a mechanism to put structure on this data and it also provides a simple query language called ... [More]

0 lines of code

0 current contributors

0 since last commit

23 users on Open Hub

Activity Not Available

0 Reviews

I Use This

Mostly written in language not available

Licenses: apache_2

Tags apache bigdata cluster clustercomputing distributed_computing hadoop hdfs java mapreduce orc spark sql 4 more...

riak

R

Analyzed about 1 year ago

Riak combines a decentralized key-value store, a flexible map/reduce engine, and a friendly HTTP/JSON query interface to provide a database ideally suited for Web applications.

208K lines of code

0 current contributors

about 2 years since last commit

19 users on Open Hub

Activity Not Available

0 Reviews

I Use This

Mostly written in Erlang

Licenses: apache_2

Tags consistent-hashing decentralized erlang http javascript json key-value mapreduce

hazelcast

Analyzed about 6 hours ago

Hazelcast is a clustering and highly scalable data distribution platform for Java. Features: Distributed implementations of java.util.{Queue, Set, List, Map} Distributed implementation of java.util.concurrency.locks.Lock Distributed implementation of java.util.concurrent.ExecutorService ... [More]

1.52M lines of code

66 current contributors

about 12 hours since last commit

15 users on Open Hub

High Activity

0 Reviews

I Use This

Mostly written in Java

Licenses: apache_2

Tags cache cloud cluster datagrid dht distributed distributed_computing distributedsystems ehcache elastic grid gridcomputing 8 more...

Infinispan

Analyzed 1 day ago

Infinispan is an open source, JVM based data grid platform. Infinispan is a high performance, distributed and highly concurrent data structure. Also supports JTA transactions, eviction, and passivation/overflow to external storage.

705K lines of code

36 current contributors

1 day since last commit

11 users on Open Hub

High Activity

0 Reviews

I Use This

Mostly written in Java

Licenses: apache_2, lgpl21

Tags caching cloud clustering datagrid distributed distributedcomputing distributedsystems distribution elastic grid gridcomputing hibernate 8 more...

ScUtil

Analyzed 1 day ago

Hundreds of functions of a variety of topics, from statistics to string parsing, module utilities to network tools. Everyone's pet library accumulates features over time. My erlang library got big, fast. I often find myself giving functions from it out to other people, and a lot of my other ... [More]

9.39K lines of code

0 current contributors

over 9 years since last commit

11 users on Open Hub

Inactive

0 Reviews

I Use This

Mostly written in Erlang

Licenses: mit

Tags algorithms bayesian bayesianinference bayesianmodelling concurrency conversion correlation development dispatch distributed distributed_computing distributedcomputing 40 more...

Apache Flink

Claimed by Apache Software Foundation Analyzed 1 day ago

Apache Flink is a framework and distributed processing engine for stateful computations over unbounded and bounded data streams. Flink has been designed to run in all common cluster environments, perform computations at in-memory speed and at any scale. Learn more about Flink at http://flink.apache.org/

2.12M lines of code

323 current contributors

1 day since last commit

9 users on Open Hub

Very High Activity

0 Reviews

I Use This

Mostly written in Java

Licenses: apache_2

Tags apache bigdata cluster distributed hadoop java machinelearning mapreduce scala streaming

Lokad.Cloud - O/C mapper for Azure

No analysis available

O/C mapper (object to cloud). Leverage Windows Azure without getting dragged down by low level technicalities. Key features * Queue Services as a scalable equivalent of Windows Services. * Scheduled Services as a cloud equivalent of the task scheduler. * Strong-typed blob I/O. * Scalable logs ... [More]

0 lines of code

0 current contributors

0 since last commit

5 users on Open Hub

Activity Not Available

0 Reviews

I Use This

Mostly written in language not available

Licenses: BSD-3-Clause

Tags azure cloud cloudcomputing csharp dotnet framework library lokad mapreduce scalability

Tags : Browse Projects