An ML engineer uses one ML framework to train multiple ML models. The ML engineer needs t…

Question

An ML engineer uses one ML framework to train multiple ML models. The ML engineer needs to optimize the inference costs and host the models on Amazon SageMaker AI. Which solution will meet these requirements MOST cost-effectively?

Accepted Answer

Correct answer: B. B. Create a multi-model inference endpoint for all the models. — The correct answer is B because a multi-model inference endpoint allows for efficient resource utilization by hosting multiple models in a single endpoint, significantly reducing costs. Options A and C involve multi-container endpoints which can be more expensive and less efficient for this scenario, while option D would require separate resources for each model, leading to higher costs.

AWS Certified Machine Learning Engineer – Associate (MLA-C01) — Question 153

Answer options

Correct answer: B

Explanation