Posted On: Sep 6, 2022

EMR Studio is an integrated development environment (IDE) that makes it easy for data scientists and data engineers to develop, visualize, and debug big data and analytics applications written in R, Python, Scala, and PySpark. Today, we are excited to announce that EMR Studio is now available in Amazon Web Services China (Beijing) region, operated by Sinnet, and Amazon Web Services China (Ningxia) region, operated by NWCD.

EMR Studio provides fully managed Jupyter Notebooks to run interactive workloads on EMR. It also provides tools like Spark UI and YARN Timeline Service to simplify debugging. Users of EMR Studio can install custom kernels and libraries, collaborate with peers using code repositories such as GitHub and BitBucket, or execute parameterized notebooks as part of scheduled workflows using orchestration services like Apache Airflow.

Administrators can set up EMR Studio such that analysts can run their applications on existing EMR clusters or create new clusters using pre-defined Amazon CloudFormation templates for EMR. EMR Studio is generally available on EMR release version 5.32.0 and 6.2.0 and later.

You can learn more by reading our Amazon EMR Studio documentation, or visiting the Amazon EMR Studio detail page.