Apache Spark

By
Apache Software Foundation
v
Distributed, in-memory data analytics and ML.
Apache Spark
From
Vendor
Apache Software Foundation
Version

Features

In-memory big data engine

Supports batch, streaming, MLlib, SQL, GraphX

Works with Hadoop, cloud, Mesos, K8s

Optimized query execution

Large ecosystem of connectors

Apache Spark
What is Apache Spark?
Scalable, high-speed analytics engine.
Key Features
* In-memory
* MLlib
* GraphX
Use Cases
* ETL
* Batch & stream analytics
* AI pipelines