You work for a retail company that is using a regression model built with BigQuery ML to…

Question

You work for a retail company that is using a regression model built with BigQuery ML to predict product sales. This model is being used to serve online predictions. Recently you developed a new version of the model that uses a different architecture (custom model). Initial analysis revealed that both models are performing as expected. You want to deploy the new version of the model to production and monitor the performance over the next two months. You need to minimize the impact to the existing and future model users. How should you deploy the model?

Accepted Answer

Correct answer: A. A. Import the new model to the same Vertex AI Model Registry as a different version of the existing model. Deploy the new model to the same Vertex AI endpoint as the existing model, and use traffic splitting to route 95% of production traffic to the BigQuery ML model and 5% of production traffic to the new model. — The correct answer is A because it allows for gradual testing of the new model with minimal disruption to users by splitting traffic. This approach ensures that the existing model continues to serve the majority of requests while still allowing for the new model to be evaluated in a controlled manner. Options B and C do not adequately utilize traffic splitting, and option D introduces unnecessary complexity with a separate service.

Google Cloud Professional Machine Learning Engineer — Question 230

Answer options

Correct answer: A

Explanation