A company has an ML model that is deployed to an Amazon SageMaker endpoint for real-time…

Question

A company has an ML model that is deployed to an Amazon SageMaker endpoint for real-time inference. The company needs to deploy a new model. The company must compare the new model’s performance to the currently deployed model’s performance before shifting all traffic to the new model. Which solution will meet these requirements with the LEAST operational effort?

Accepted Answer

Correct answer: C. C. Deploy the new model as a shadow variant on the same endpoint as the current model. Route a portion of live traffic to the shadow model for evaluation. — The correct answer is C because deploying the new model as a shadow variant allows for real-time traffic evaluation with minimal operational effort. The other options require more complex setups, such as manual traffic splitting or the use of additional services like CloudFront or Lambda, which increase operational overhead.

AWS Certified Machine Learning Engineer – Associate (MLA-C01) — Question 115

Answer options

Correct answer: C

Explanation