You are creating an ML pipeline for data processing, model training, and model deployment…

Question

You are creating an ML pipeline for data processing, model training, and model deployment that uses different Google Cloud services. You have developed code for each individual task, and you expect a high frequency of new files. You now need to create an orchestration layer on top of these tasks. You only want this orchestration pipeline to run if new files are present in your dataset in a Cloud Storage bucket. You also want to minimize the compute node costs. What should you do?

Accepted Answer

Correct answer: C. C. Create a pipeline in Vertex AI Pipelines. Create a Cloud Function that uses a Cloud Storage trigger and deploys the pipeline. — The correct answer is C because it combines the capabilities of Vertex AI Pipelines and Cloud Functions to trigger the workflow based on new files in Cloud Storage, ensuring efficient orchestration. Option A runs the pipeline on a schedule, which doesn't strictly respond to new files, while option B lacks the direct orchestration of the pipeline itself, and option D, although effective, does not utilize Vertex AI Pipelines for model training and deployment.

Google Cloud Professional Machine Learning Engineer — Question 273

Answer options

Correct answer: C

Explanation