AWS Certified Solutions Architect – Professional — Question 708

A company is using an Amazon EMR cluster to run its big data jobs. The cluster's jobs are invoked by AWS Step Functions Express Workflows that consume various Amazon Simple Queue Service (Amazon SQS) queues. The workload of this solution is variable and unpredictable. Amazon CloudWatch metrics show that the cluster's peak utilization is only 25% at times and that the cluster sits idle the rest of the time.
A solutions architect must optimize the costs of the cluster without negatively impacting the time it takes to run the various jobs.
What is the MOST cost-effective solution that meets these requirements?

Answer options

Correct answer: D

Explanation

Option D is correct because using a diversified task fleet with capacity-optimized Spot Instances allows Amazon EMR to provision capacity from the most available pools, minimizing Spot interruptions while drastically reducing costs for unpredictable workloads. Combining On-Demand and Spot Instances ensures that baseline processing is guaranteed while scaling out cheaply. Options A, B, and C are incorrect because they either introduce unnecessary costs (such as Dedicated On-Demand Instances or Reserved Instances for highly variable workloads) or degrade performance by removing task nodes entirely.