56
I Use This!
Very High Activity
Analyzed 1 day ago. based on code collected 1 day ago.

Project Summary

Apache Spark is an open source cluster computing system that aims to make data analytics fast — both fast to run and fast to write.

To run programs faster, Spark provides primitives for in-memory cluster computing: your job can load data into memory and query it repeatedly more rapidly than with disk-based systems like Hadoop.

To make programming faster, Spark offers high-level APIs in Scala, Java and Python, letting you manipulate distributed datasets like local collections. You can also use Spark interactively to query big data from the Scala or Python shells.

Spark integrates closely with Hadoop to run inside Hadoop clusters and can access any existing Hadoop data source.

Tags

apache bigdata cluster clustercomputing distributed distributed_computing ec2 graph_computing hadoop hdfs in_memory java machine_learning mapreduce ml python scala sql streaming streamingdata

Apache License 2.0
Permitted

Commercial Use

Modify

Distribute

Place Warranty

Sub-License

Private Use

Use Patent Claims

Forbidden

Hold Liable

Use Trademarks

Required

Include Copyright

State Changes

Include License

Include Notice

These details are provided for information only. No information here is legal advice and should not be used as such.

Project Security

Vulnerabilities per Version ( last 10 releases )

There are no reported vulnerabilities

Project Vulnerability Report

Security Confidence Index

Poor security track-record
Favorable security track-record

Vulnerability Exposure Index

Many reported vulnerabilities
Few reported vulnerabilities

Did You Know...

  • ...
    65% of companies leverage OSS to speed application development in 2016
  • ...
    learn about Open Hub updates and features on the Open Hub blog
  • ...
    there are over 3,000 projects on the Open Hub with security vulnerabilities reported against them
  • ...
    you can subscribe to e-mail newsletters to receive update from the Open Hub blog
About Project Security

Languages

Scala
67%
Python
19%
Java
7%
12 Other
7%

30 Day Summary

Aug 31 2025 — Sep 30 2025

12 Month Summary

Sep 30 2024 — Sep 30 2025
  • 3311 Commits
    Down -717 (17%) from previous 12 months
  • 308 Contributors
    Down -4 (1%) from previous 12 months

Ratings

8 users rate this project:
5.0
 
5.0/5.0
Click to add your rating
  
Review this Project!