Audience

Organizations that want a unified analytics engine for large-scale data processing

About Apache Spark

Apache Spark™ is a unified analytics engine for large-scale data processing. Apache Spark achieves high performance for both batch and streaming data, using a state-of-the-art DAG scheduler, a query optimizer, and a physical execution engine. Spark offers over 80 high-level operators that make it easy to build parallel apps. And you can use it interactively from the Scala, Python, R, and SQL shells. Spark powers a stack of libraries including SQL and DataFrames, MLlib for machine learning, GraphX, and Spark Streaming. You can combine these libraries seamlessly in the same application. Spark runs on Hadoop, Apache Mesos, Kubernetes, standalone, or in the cloud. It can access diverse data sources. You can run Spark using its standalone cluster mode, on EC2, on Hadoop YARN, on Mesos, or on Kubernetes. Access data in HDFS, Alluxio, Apache Cassandra, Apache HBase, Apache Hive, and hundreds of other data sources.

Pricing

Free Version:
Free Version available.

Integrations

Ratings/Reviews

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Company Information

Apache Software Foundation
Founded: 1999
United States
spark.apache.org

Videos and Screen Captures

Apache Spark Screenshot 1
Other Useful Business Software
Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
Try Free

Product Details

Platforms Supported
Cloud
Training
Documentation

Apache Spark Frequently Asked Questions

Q: What kinds of users and organization types does Apache Spark work with?
Q: What languages does Apache Spark support in their product?
Q: What other applications or services does Apache Spark integrate with?
Q: What type of training does Apache Spark provide?

Apache Spark Product Features

Big Data

Templates
Data Visualization
Collaboration
Data Blends
Data Cleansing
Data Warehousing
High Volume Processing
Data Mining
No-Code Sandbox
Predictive Analytics

Data Analysis

Data Visualization
Text Analytics
Regression Analysis
Data Discovery
Sentiment Analysis
High Volume Processing
Statistical Modeling
Predictive Analytics

Streaming Analytics

Data Enrichment
Data Wrangling / Data Prep
Multiple Data Source Support
Process Automation
Real-time Analysis / Reporting
Visualization Dashboards

Apache Spark Additional Categories