Google Cloud Professional Machine Learning Engineer — Question 338

You have trained a model on a dataset that required computationally expensive preprocessing operations. You need to execute the same preprocessing at prediction time. You deployed the model on AI Platform for high-throughput online prediction. Which architecture should you use?

Answer options

Correct answer: B

Explanation

Option B is correct because it efficiently utilizes a Dataflow job to transform incoming data in real-time before submitting it to AI Platform for predictions, ensuring that the preprocessing is handled seamlessly. Options A and C suggest creating new architectures that do not address the need for real-time preprocessing at scale. Option D, while it uses a Pub/Sub topic, may not handle high-throughput as effectively as a Dataflow job does.