Google Cloud Professional Machine Learning Engineer — Question 183

You have created a Vertex AI pipeline that includes two steps. The first step preprocesses 10 TB data completes in about 1 hour, and saves the result in a Cloud Storage bucket. The second step uses the processed data to train a model. You need to update the model’s code to allow you to test different algorithms. You want to reduce pipeline execution time and cost while also minimizing pipeline changes. What should you do?

Answer options

Correct answer: D

Explanation

Enabling caching for the pipeline job allows previously computed results to be reused, significantly reducing execution time and cost for repeated runs, especially in the data preprocessing step. The other options either introduce unnecessary complexity, do not leverage efficient resource utilization, or do not align with the goal of minimizing changes to the pipeline.