Tags : Browse Projects

Select a tag to browse associated projects and drill deeper into the tag cloud.

deeplearning4j

Compare

  Analyzed about 6 hours ago

Deeplearning4j is the first commercial-grade, open-source, distributed deep-learning library; designed to be used in business environments. Deeplearning4j aims to be cutting-edge plug and play, more convention than configuration, which allows for fast prototyping for non-researchers. Vast ... [More] support of scale out: Hadoop, Spark and Akka + AWS et al It includes both a distributed, multi-threaded deep-learning framework and a normal single-threaded deep-learning framework. Iterative reduce net training. First framework adapted for a micro-service architecture. A versatile n-dimensional array class. GPU integration [Less]

1.1M lines of code

17 current contributors

4 months since last commit

5 users on Open Hub

Very Low Activity
4.0
   
I Use This

Apache Ignite

Compare

Claimed by Apache Software Foundation Analyzed about 18 hours ago

Apache Ignite In-Memory Data Fabric is a high-performance, integrated and distributed in-memory platform for computing and transacting on large-scale data sets in real-time, orders of magnitude faster than possible with traditional disk-based or flash technologies.

1.51M lines of code

0 current contributors

2 days since last commit

4 users on Open Hub

High Activity
0.0
 
I Use This

Facebook Presto

Compare

Claimed by Facebook Analyzed about 12 hours ago

Distributed SQL query engine for big data Presto is an open source distributed SQL query engine for running interactive analytic queries against data sources of all sizes ranging from gigabytes to petabytes. Presto was designed and written from the ground up for interactive analytics and ... [More] approaches the speed of commercial data warehouses while scaling to the size of organizations like Facebook. Presto is a distributed SQL query engine optimized for ad-hoc analysis at interactive speed. It supports standard ANSI SQL, including complex queries, aggregations, joins, and window functions. [Less]

2.65M lines of code

125 current contributors

4 days since last commit

4 users on Open Hub

Very High Activity
0.0
 
I Use This

Apache Airavata

Compare

Claimed by Apache Software Foundation Analyzed about 3 hours ago

Apache Airavata is a software toolkit currently used to build science gateways but that has a much wider potential use. It provides features to compose, manage, execute, and monitor small to large scale applications and workflows on computational resources ranging from local clusters to national ... [More] grids and computing clouds. Gadgets interfaces to Airavata back end services can be deployed in open social containers such as Apache Rave and modify them to suit their needs. Airavata builds on general concepts of service oriented computing, distributed messaging, and workflow composition and orchestration. [Less]

2.78M lines of code

15 current contributors

15 days since last commit

4 users on Open Hub

Moderate Activity
0.0
 
I Use This

StreamSets Data Collector

Compare

Claimed by StreamSets No analysis available

Open source software for the rapid development and ​reliable​ operation of complex data flows.

0 lines of code

60 current contributors

0 since last commit

4 users on Open Hub

Activity Not Available
5.0
 
I Use This
Mostly written in language not available
Licenses: apache_2

Apache Flume

Compare

Claimed by Apache Software Foundation Analyzed about 7 hours ago

Apache Flume is a system for reliably collecting high-throughput data from streaming data sources like logs.

83.7K lines of code

3 current contributors

25 days since last commit

4 users on Open Hub

Low Activity
0.0
 
I Use This

snowplow

Compare

  Analyzed about 21 hours ago

Code base for computer science projects.

5.14K lines of code

15 current contributors

24 days since last commit

3 users on Open Hub

Moderate Activity
5.0
 
I Use This

Apache Giraph

Compare

Claimed by Apache Software Foundation Analyzed 4 months ago

Giraph builds upon the graph-oriented nature of Pregel but additionally adds fault-tolerance to the coordinator process with the use of ZooKeeper as its centralized coordination service. Its implemented a graph-processing framework that is launched as a typical Hadoop job to leverage existing ... [More] Hadoop infrastructure, such as Amazon's EC2. Giraph follows the bulk-synchronous parallel model relative to graphs where vertices can send messages to other vertices during a given superstep. [Less]

141K lines of code

5 current contributors

about 2 years since last commit

3 users on Open Hub

Activity Not Available
0.0
 
I Use This

Crate Data

Compare

Claimed by Crate.IO Analyzed 1 day ago

A massively scalable SQL data store. Zero administration required.

566K lines of code

30 current contributors

2 days since last commit

3 users on Open Hub

Very High Activity
5.0
 
I Use This

HPCC Systems

Compare

  Analyzed 1 day ago

HPCC (High Performance Computing Cluster) is an open source, massive parallel-processing computing platform that solves Big Data problems. The HPCC Systems architecture incorporates the Thor and Roxie clusters as well as common middleware components, an external communications layer, client ... [More] interfaces which provide both end-user services and system management tools, and auxiliary components to support monitoring and to facilitate loading and storing of filesystem data from external sources. An HPCC environment can include only Thor clusters, or both Thor and Roxie clusters. [Less]

1.67M lines of code

33 current contributors

1 day since last commit

3 users on Open Hub

Very High Activity
5.0
 
I Use This
Licenses: apache_2, Creative_...