Google Cloud Professional Machine Learning Engineer — Question 70
You work for a global footwear retailer and need to predict when an item will be out of stock based on historical inventory data Customer behavior is highly dynamic since footwear demand is influenced by many different factors. You want to serve models that are trained on all available data, but track your performance on specific subsets of data before pushing to production. What is the most streamlined and reliable way to perform this validation?
Answer options
- A. Use then TFX ModelValidator tools to specify performance metrics for production readiness.
- B. Use k-fold cross-validation as a validation strategy to ensure that your model is ready for production.
- C. Use the last relevant week of data as a validation set to ensure that your model is performing accurately on current data.
- D. Use the entire dataset and treat the area under the receiver operating characteristics curve (AUC ROC) as the main metric.
Correct answer: A
Explanation
The correct answer is A because TFX ModelValidator tools are specifically designed to set performance metrics to assess if the model is production-ready, allowing for focused validation on data subsets. Option B, while a valid technique, is not as streamlined as using dedicated tools like TFX. Option C restricts validation to a narrow time frame, which may not represent overall performance, and Option D does not focus on specific subsets, missing the targeted validation aspect.