AWS Certified Solutions Architect – Professional (SAP-C02) — Question 266

A company has built a high performance computing (HPC) cluster in AWS for a tightly coupled workload that generates a large number of shared files stored in Amazon EFS. The cluster was performing well when the number of Amazon EC2 instances in the cluster was 100. However, when the company increased the cluster size to 1.000 EC2 instances, overall performance was well below expectations.

Which collection of design choices should a solutions architect make to achieve the maximum performance from the HPC cluster? (Choose three.)

Answer options

Correct answer: A, C, F

Explanation

To maximize performance for a tightly coupled HPC workload scaled to 1,000 instances, launching the cluster in a single Availability Zone minimizes network latency, while utilizing Elastic Fabric Adapter (EFA) provides high-bandwidth, low-latency inter-node communication. Replacing Amazon EFS with Amazon FSx for Lustre is critical because FSx for Lustre is specifically designed for high-performance, sub-millisecond, and highly concurrent file access required by massive HPC clusters. Other options, like multiple AZs or EBS RAID arrays, either increase latency or fail to provide a distributed, concurrent shared file system for 1,000 instances.