Google Cloud Professional Machine Learning Engineer — Question 142

While running a model training pipeline on Vertex Al, you discover that the evaluation step is failing because of an out-of-memory error. You are currently using TensorFlow Model Analysis (TFMA) with a standard Evaluator TensorFlow Extended (TFX) pipeline component for the evaluation step. You want to stabilize the pipeline without downgrading the evaluation quality while minimizing infrastructure overhead. What should you do?

Answer options

Correct answer: A

Explanation

The correct option, A, suggests using Dataflow to run the evaluation step, which can better handle memory management and scale resources as needed, thereby preventing out-of-memory errors. Option B may require additional management and does not optimize resource usage as effectively as Dataflow. Option C involves more complexity and could introduce overhead rather than reduce it. Option D may limit metrics but does not address the root cause of the memory issue.