AWS Certified Solutions Architect – Professional — Question 437

Your company is storing millions of sensitive transactions across thousands of 100-GB files that must be encrypted in transit and at rest. Analysts concurrently depend on subsets of files, which can consume up to 5 TB of space, to generate simulations that can be used to steer business decisions.
You are required to design an AWS solution that can cost effectively accommodate the long-term storage and in-flight subsets of data.
Which approach can satisfy these objectives?

Answer options

Correct answer: A

Explanation

Amazon S3 provides the most cost-effective and durable long-term storage for large-scale datasets, and it natively supports server-side encryption. For processing 5 TB subsets, using EC2 ephemeral drives (instance store) is highly cost-effective because it avoids the premium cost of high-memory instances required for in-memory processing. In contrast, maintaining an active Amazon EMR cluster for HDFS or using provisioned EBS volumes for the entire dataset would be significantly more expensive.