AWS Certified Machine Learning Engineer – Associate (MLA-C01) — Question 68

An ML engineer is using Amazon SageMaker to train a deep learning model that requires distributed training. After some training attempts, the ML engineer observes that the instances are not performing as expected. The ML engineer identifies communication overhead between the training instances.
What should the ML engineer do to MINIMIZE the communication overhead between the instances?

Answer options

Correct answer: C

Explanation

The correct answer is C because placing the instances in the same VPC subnet and storing the data in the same AWS Region and Availability Zone minimizes latency and communication overhead. Options A and B suggest storing data in a different AWS Region, which would increase communication delays. Option D, while in the same Region, still places the data in a different Availability Zone, leading to potential overhead issues.