A company has an ecommerce website with a product recommendation engine built in TensorFl…

Question

A company has an ecommerce website with a product recommendation engine built in TensorFlow. The recommendation engine endpoint is hosted by Amazon
SageMaker. Three compute-optimized instances support the expected peak load of the website.
Response times on the product recommendation page are increasing at the beginning of each month. Some users are encountering errors. The website receives the majority of its traffic between 8 AM and 6 PM on weekdays in a single time zone.
Which of the following options are the MOST effective in solving the issue while keeping costs to a minimum? (Choose two.)

Accepted Answer

Correct answer: A, C. A. Configure the endpoint to use Amazon Elastic Inference (EI) accelerators. — C. Configure the endpoint to automatically scale with the InvocationsPerInstance metric. — Choosing options A and C provides cost-effective solutions to enhance performance. Option A introduces Amazon Elastic Inference, which accelerates TensorFlow model inference without significant cost increases. Option C allows for dynamic scaling based on traffic, ensuring resources are utilized efficiently during peak times. The other options either do not directly address the performance issues or may incur higher costs without guaranteeing improved response times.

AWS Certified Machine Learning – Specialty — Question 165

Answer options

Correct answer: A, C

Explanation