Apache Spark
By
Apache Software Foundation
•
v
Distributed, in-memory data analytics and ML.
From
Vendor
Apache Software Foundation
Version
Features
In-memory big data engine
Supports batch, streaming, MLlib, SQL, GraphX
Works with Hadoop, cloud, Mesos, K8s
Optimized query execution
Large ecosystem of connectors
Apache Spark
What is Apache Spark?
Scalable, high-speed analytics engine.
Key Features
* In-memory
* MLlib
* GraphX
Use Cases
* ETL
* Batch & stream analytics
* AI pipelines
