A data scientist at a financial services company used Amazon SageMaker to train and deplo…

Question

A data scientist at a financial services company used Amazon SageMaker to train and deploy a model that predicts loan defaults. The model analyzes new loan applications and predicts the risk of loan default. To train the model, the data scientist manually extracted loan data from a database. The data scientist performed the model training and deployment steps in a Jupyter notebook that is hosted on SageMaker Studio notebooks. The model's prediction accuracy is decreasing over time. Which combination of steps is the MOST operationally efficient way for the data scientist to maintain the model's accuracy? (Choose two.)

Accepted Answer

Correct answer: A, B. A. Use SageMaker Pipelines to create an automated workflow that extracts fresh data, trains the model, and deploys a new version of the model. — B. Configure SageMaker Model Monitor with an accuracy threshold to check for model drift. Initiate an Amazon CloudWatch alarm when the threshold is exceeded. Connect the workflow in SageMaker Pipelines with the CloudWatch alarm to automatically initiate retraining. — Option A is correct because automating the data extraction and model training process with SageMaker Pipelines increases efficiency and ensures timely updates to the model. Option B complements this by monitoring model performance and triggering retraining when necessary, which is essential for maintaining accuracy. Options C and D are less efficient as they either lack automation or rely on manual processes, while option E introduces unnecessary complexity without directly addressing accuracy maintenance.

AWS Certified Machine Learning – Specialty — Question 216

Answer options

Correct answer: A, B

Explanation