You work for an organization that operates a streaming music service. You have a custom p…

Question

You work for an organization that operates a streaming music service. You have a custom production model that is serving a “next song” recommendation based on a user's recent listening history. Your model is deployed on a Vertex AI endpoint. You recently retrained the same model by using fresh data. The model received positive test results offline. You now want to test the new model in production while minimizing complexity. What should you do?

Accepted Answer

Correct answer: C. C. Deploy the new model to the existing Vertex AI endpoint. Use traffic splitting to send 5% of production traffic to the new model. Monitor end-user metrics, such as listening time. If end-user metrics improve between models over time, gradually increase the percentage of production traffic sent to the new model. — The correct answer is C because it allows for a gradual introduction of the new model while monitoring its impact on user metrics in real-time. This minimizes complexity by using the existing endpoint and traffic splitting rather than creating a new endpoint or conducting separate experiments, which can be more cumbersome and time-consuming. Options A and B involve more complexity and do not utilize the immediate feedback from live user data, while option D relies heavily on monitoring without proactively testing the new model's performance in production.

Google Cloud Professional Machine Learning Engineer — Question 255

Answer options

Correct answer: C

Explanation