AWS Certified Machine Learning – Specialty — Question 216

A data scientist at a financial services company used Amazon SageMaker to train and deploy a model that predicts loan defaults. The model analyzes new loan applications and predicts the risk of loan default. To train the model, the data scientist manually extracted loan data from a database. The data scientist performed the model training and deployment steps in a Jupyter notebook that is hosted on SageMaker Studio notebooks. The model's prediction accuracy is decreasing over time.

Which combination of steps is the MOST operationally efficient way for the data scientist to maintain the model's accuracy? (Choose two.)

Answer options

Correct answer: A, B

Explanation

Option A is correct because automating the data extraction and model training process with SageMaker Pipelines increases efficiency and ensures timely updates to the model. Option B complements this by monitoring model performance and triggering retraining when necessary, which is essential for maintaining accuracy. Options C and D are less efficient as they either lack automation or rely on manual processes, while option E introduces unnecessary complexity without directly addressing accuracy maintenance.