Tags : Browse Projects

Select a tag to browse associated projects and drill deeper into the tag cloud.

oryxproject

Compare

  Analyzed 1 day ago

Oryx 2 (incubating): Lambda architecture on Spark for real-time large scale machine learning

131K lines of code

1 current contributors

almost 3 years since last commit

0 users on Open Hub

Inactive
5.0
 
I Use This

Cloudml Zen

Compare

  Analyzed 31 minutes ago

Zen aims to provide the largest scale and the most efficient machine learning platform on top of Spark, including but not limited to logistic regression, latent dirichilet allocation, factorization machines and DNN.

15.7K lines of code

1 current contributors

over 5 years since last commit

0 users on Open Hub

Inactive
0.0
 
I Use This

StratioSparta

Compare

  Analyzed 1 day ago

Real Time Aggregation based on Spark Streaming

51.6K lines of code

2 current contributors

over 4 years since last commit

0 users on Open Hub

Inactive
0.0
 
I Use This
Tags scala spark

xgboost

Compare

  Analyzed 1 day ago

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Flink and DataFlow

146K lines of code

78 current contributors

3 days since last commit

0 users on Open Hub

High Activity
0.0
 
I Use This

Apache Spot (Incubating)

Compare

  Analyzed 2 days ago

Apache Spot is a community-driven cybersecurity project, built from the ground up, to bring advanced analytics to all IT Telemetry data on an open, scalable platform. It is an open source software for leveraging insights from flow and packet analysis. Spot expedites threat detection, investigation ... [More] , and remediation via machine learning and consolidates all enterprise security data into a comprehensive IT telemetry hub based on open data models. Spot’s scalability and machine learning capabilities support an ecosystem of ML-based applications that can run simultaneously on a single, shared, enriched data set to provide organizations with maximum analytic flexibility. [Less]

10.4M lines of code

3 current contributors

3 days since last commit

0 users on Open Hub

Very High Activity
0.0
 
I Use This
Licenses: No declared licenses

Apache SystemML

Compare

Claimed by Apache Software Foundation Analyzed about 14 hours ago

Declarative large-scale machine learning (ML) that aims at flexible specification of ML algorithms and automatic generation of hybrid runtime plans ranging from single-node, in-memory computations, to distributed computations on Apache Hadoop and Apache Spark. ML algorithms are expressed in an ... [More] R-like or Python-like syntax that includes linear algebra primitives, statistical functions, and ML-specific constructs. This high-level language significantly increases the productivity of data scientists as it provides (1) full flexibility in expressing custom analytics, and (2) data independence from the underlying input formats and physical data representations. Automatic optimization according to data and cluster characteristics ensures both efficiency and scalability. [Less]

1.92M lines of code

7 current contributors

8 days since last commit

0 users on Open Hub

High Activity
0.0
 
I Use This

Distributed DataFrame

Compare

  Analyzed about 2 hours ago

Distributed DataFrame: Productivity = Power x Simplicity For Scientists & Engineers, on any Data Engine

20.9K lines of code

0 current contributors

almost 8 years since last commit

0 users on Open Hub

Inactive
0.0
 
I Use This
Licenses: No declared licenses

BigInsights-on-Apache-Hadoop

Compare

  Analyzed about 17 hours ago

Example projects for 'BigInsights for Apache Hadoop' on IBM Bluemix

7.92K lines of code

0 current contributors

over 6 years since last commit

0 users on Open Hub

Inactive
0.0
 
I Use This

movie-recommender-demo

Compare

  Analyzed about 2 hours ago

This project walks through how you can create recommendations using Apache Spark machine learning. There are a number of jupyter notebooks that you can run on IBM Data Science Experience, and there a live demo of a movie recommendation web application you can interact with. The demo also uses IBM ... [More] Message Hub (kafka) to push application events to topic where they are consumed by a spark streaming job running on IBM BigInsights (hadoop). [Less]

2.51K lines of code

0 current contributors

almost 2 years since last commit

0 users on Open Hub

Very Low Activity
0.0
 
I Use This