While running a model training pipeline on Vertex Al, you discover that the evaluation st…

Question

While running a model training pipeline on Vertex Al, you discover that the evaluation step is failing because of an out-of-memory error. You are currently using TensorFlow Model Analysis (TFMA) with a standard Evaluator TensorFlow Extended (TFX) pipeline component for the evaluation step. You want to stabilize the pipeline without downgrading the evaluation quality while minimizing infrastructure overhead. What should you do?

Accepted Answer

Correct answer: A. A. Include the flag -runner=DataflowRunner in beam_pipeline_args to run the evaluation step on Dataflow. — The correct option, A, suggests using Dataflow to run the evaluation step, which can better handle memory management and scale resources as needed, thereby preventing out-of-memory errors. Option B may require additional management and does not optimize resource usage as effectively as Dataflow. Option C involves more complexity and could introduce overhead rather than reduce it. Option D may limit metrics but does not address the root cause of the memory issue.

Google Cloud Professional Machine Learning Engineer — Question 142

Answer options

Correct answer: A

Explanation