Google Cloud Professional Machine Learning Engineer — Question 218

Your company manages an ecommerce website. You developed an ML model that recommends additional products to users in near real time based on items currently in the user’s cart. The workflow will include the following processes:

1. The website will send a Pub/Sub message with the relevant data and then receive a message with the prediction from Pub/Sub
2. Predictions will be stored in BigQuery
3. The model will be stored in a Cloud Storage bucket and will be updated frequently

You want to minimize prediction latency and the effort required to update the model. How should you reconfigure the architecture?

Answer options

Correct answer: D

Explanation

The correct answer, D, allows for efficient serving of predictions by using the RunInference API, which can monitor changes in the model and minimize latency. Option A does not effectively handle model updates, while B introduces unnecessary complexity with a pipeline. Option C, while functional, may not be as efficient as directly using the RunInference API for real-time predictions.