A company needs to deploy a custom-trained classification ML model on AWS. The model must…

Question

A company needs to deploy a custom-trained classification ML model on AWS. The model must make near real-time predictions with low latency and must handle variable request volumes. Which solution will meet these requirements?

Accepted Answer

Correct answer: C. C. Deploy an Amazon SageMaker AI endpoint. Configure auto scaling for the endpoint. — The correct answer is C because deploying an Amazon SageMaker AI endpoint with auto scaling allows for low latency and the ability to handle variable request volumes in near real-time. Option A is incorrect as batch processing does not meet the near real-time requirement. Option B is not suitable because using an S3 bucket does not provide the necessary low-latency predictions. Option D involves more complexity and does not guarantee the same level of performance as an auto-scaled SageMaker endpoint.

AWS Certified Machine Learning Engineer – Associate (MLA-C01) — Question 171

Answer options

Correct answer: C

Explanation