- Products›
- Amazon EMR›
- Amazon EMR features
Apache Spark on Amazon EMR
Overview
Page topics
Features and benefits
Open allEMR features Amazon EMR runtime for Apache Spark, a performance-optimized runtime environment for Apache Spark that is active by default on Amazon EMR clusters. Amazon EMR runtime for Apache Spark can be faster than clusters without the EMR runtime, and has 100% API compatibility with standard Apache Spark. This improved performance means your workloads run faster and saves you compute costs, without making any changes to your applications.
By using a directed acyclic graph (DAG) execution engine, Spark can create efficient query plans for data transformations. Spark also stores input, output, and intermediate data in-memory as resilient data frames, which allows for fast processing without I/O cost, boosting performance of iterative or interactive workloads.