AWS Certified Machine Learning – Specialty — Question 302

An ecommerce company has used Amazon SageMaker to deploy a factorization machines (FM) model to suggest products for customers. The company’s data science team has developed two new models by using the TensorFlow and PyTorch deep learning frameworks. The company needs to use A/B testing to evaluate the new models against the deployed model.

The required A/B testing setup is as follows:

• Send 70% of traffic to the FM model, 15% of traffic to the TensorFlow model, and 15% of traffic to the PyTorch model.
• For customers who are from Europe, send all traffic to the TensorFlow model.

Which architecture can the company use to implement the required A/B testing setup?

Answer options

Correct answer: D

Explanation

Amazon SageMaker supports deploying multiple models as production variants under a single endpoint and distributing traffic among them based on custom weights defined in the endpoint configuration. To override this default weighted routing for specific subsets of traffic, such as European users, the client application can pass the 'TargetVariant' header in the InvokeEndpoint API request to target a specific model variant directly. Introducing load balancers as suggested in options A and C adds unnecessary architectural complexity, while option B incorrectly suggests using auto scaling policies to manage A/B testing traffic split weights.