AWS Certified Solutions Architect – Professional — Question 1010

A company is running a data-intensive application on AWS. The application runs on a cluster of hundreds of Amazon EC2 instances. A shared file system also runs on several EC2 instances that store 200 TB of data. The application reads and modifies the data on the shared file system and generates a report. The job runs once monthly, reads a subset of the files from the shared file system, and takes about 72 hours to complete. The compute instances scale in an Auto Scaling group, but the instances that host the shared the system run continuously. The compute and storage instances are all in the same AWS Region.
A solutions architect needs to reduce costs by replacing the shared file system instances. The file system must provide high performance access to the needed data for the duration of the 72-hour run.
Which solution will provide the LARGEST overall cost reduction while meeting these requirements?

Answer options

Correct answer: A

Explanation

Storing the 200 TB of data in Amazon S3 Intelligent-Tiering minimizes monthly storage costs for data that remains inactive for most of the month. Creating an Amazon FSx for Lustre file system on-demand using lazy loading allows the monthly job to access only the required subset of files with high performance, without incurring the time and cost of batch loading all 200 TB. Other options like EBS Multi-Attach do not support hundreds of instances, and AWS Storage Gateway cannot deliver the high-performance throughput required for a massive EC2 cluster.