AWS Certified Solutions Architect – Professional — Question 624

A company has a High Performance Computing (HPC) cluster in its on-premises data center, which runs thousands of jobs in parallel for one week every month, processing petabytes of images. The images are stored on a network file server, which is replicated to a disaster recovery site. The on-premises data center has reached capacity and has started to spread the jobs out over the course of the month in order to better utilize the cluster, causing a delay in the job completion.
The company has asked its Solutions Architect to design a cost-effective solution on AWS to scale beyond the current capacity of 5,000 cores and 10 petabytes of data. The solution must require the least amount of management overhead and maintain the current level of durability.
Which solution will meet the company's requirements?

Answer options

Correct answer: C

Explanation

AWS Batch with Managed Compute Environments is the ideal choice for running batch HPC workloads because it dynamically handles provisioning, scaling, and managing Spot Fleets with minimal administrative overhead. Storing the petabytes of raw image data in Amazon S3 ensures high durability and cost-efficiency, while using local EBS volumes for temporary processing storage avoids the performance bottlenecks and high costs associated with shared file systems like Amazon EFS. Alternative solutions involving custom ECS, EC2, or EMR scaling configurations require significantly more management overhead and are less cost-effective.