AWS Certified Big Data – Specialty — Question 56

An administrator is deploying Spark on Amazon EMR for two distinct use cases: machine learning algorithms and ad-hoc querying. All data will be stored in Amazon S3. Two separate clusters for each use case will be deployed. The data volumes on Amazon S3 are less than 10 GB.
How should the administrator align instance types with the clusters purpose?

Answer options

Correct answer: A

Explanation

Option A is correct because C instance types are optimized for compute-intensive tasks like machine learning, while R instance types are suited for memory-intensive tasks like ad-hoc querying. The other options do not align the instance types with the specific requirements of the use cases, resulting in suboptimal performance.