An ML engineer is using Amazon SageMaker to train a deep learning model that requires dis…

Question

An ML engineer is using Amazon SageMaker to train a deep learning model that requires distributed training. After some training attempts, the ML engineer observes that the instances are not performing as expected. The ML engineer identifies communication overhead between the training instances.
What should the ML engineer do to MINIMIZE the communication overhead between the instances?

Accepted Answer

Correct answer: C. C. Place the instances in the same VPC subnet. Store the data in the same AWS Region and Availability Zone where the instances are deployed. — The correct answer is C because placing the instances in the same VPC subnet and storing the data in the same AWS Region and Availability Zone minimizes latency and communication overhead. Options A and B suggest storing data in a different AWS Region, which would increase communication delays. Option D, while in the same Region, still places the data in a different Availability Zone, leading to potential overhead issues.

AWS Certified Machine Learning Engineer – Associate (MLA-C01) — Question 68

Answer options

Correct answer: C

Explanation