What does this Amazon Web Services Solution do?
JuiceFS is a distributed shared file system. JuiceFS provides standard, flexible and fully managed storage service for the Hadoop ecosystem enabling big data platforms to maximize performance on the cloud. In EMR environment, it can support almost all computing engines and it is fully compatible with HDFS. JuiceFS, with its own metadata service combined with Amazon S3, can ensure data consistency and provide better read and write performance, especially in ETL and data analysis scenarios that use Parquet and ORC column storage data formats.
Amazon Web Services Solution overview
This solution allows you to quickly get started with JuiceFS and learn how to use it as the storage backend for Amazon EMR. In addition, you can also run performance test with the script attached in the solution.
Amazon EMR with JuiceFS
Last updated: 01/2021
Author: Amazon Web Services
Estimated deployment time: 20 min
Browse our portfolio of Amazon Web Services-built solutions to common architectural problems.
Find Amazon Web Services certified consulting and technology partners to help you get started.
Sign-up and start exploring our services.