AWS Certified Data Analytics – Specialty — Question 141
A company is Running Apache Spark on an Amazon EMR cluster. The Spark job writes to an Amazon S3 bucket. The job fails and returns an HTTP 503 `Slow
Down` AmazonS3Exception error.
Which actions will resolve this error? (Choose two.)
Answer options
- A. Add additional prefixes to the S3 bucket
- B. Reduce the number of prefixes in the S3 bucket
- C. Increase the EMR File System (EMRFS) retry limit
- D. Disable dynamic partition pruning in the Spark configuration for the cluster
- E. Add more partitions in the Spark configuration for the cluster
Correct answer: A, C
Explanation
Adding additional prefixes to the S3 bucket helps distribute the load and improves performance, which can mitigate the 'Slow Down' error. Increasing the EMRFS retry limit allows the system to attempt writing again after encountering temporary issues, thereby helping to resolve the error. The other options do not address the underlying issue of request throttling effectively.