AWS Certified Solutions Architect – Professional — Question 105
You require the ability to analyze a large amount of data, which is stored on Amazon S3 using Amazon Elastic Map Reduce. You are using the cc2 8xlarge instance type, whose CPUs are mostly idle during processing. Which of the below would be the most cost efficient way to reduce the runtime of the job?
Answer options
- A. Create more, smaller flies on Amazon S3.
- B. Add additional cc2 8xlarge instances by introducing a task group.
- C. Use smaller instances that have higher aggregate I/O performance.
- D. Create fewer, larger files on Amazon S3.
Correct answer: C
Explanation
The correct answer is C because using smaller instances with better aggregate I/O performance can improve data processing efficiency, making better use of resources. Options A and D do not address the need for improved I/O performance, while B would not be cost-efficient as adding more cc2 8xlarge instances may not solve the underlying issue of underutilization.