A company has set up and deployed its machine learning (ML) model into production with an…

Question

A company has set up and deployed its machine learning (ML) model into production with an endpoint using Amazon SageMaker hosting services. The ML team has configured automatic scaling for its SageMaker instances to support workload changes. During testing, the team notices that additional instances are being launched before the new instances are ready. This behavior needs to change as soon as possible.
How can the ML team solve this issue?

Accepted Answer

Correct answer: D. D. Increase the cooldown period for the scale-out activity. — The correct answer is D because increasing the cooldown period for the scale-out activity allows more time for the newly launched instances to become ready before additional instances are created. This prevents overwhelming the system with too many instances that are not yet operational. The other options either suggest incorrect actions or do not address the timing issue related to instance readiness.

AWS Certified Machine Learning – Specialty — Question 136

Answer options

Correct answer: D

Explanation