Google Cloud Professional Machine Learning Engineer — Question 187
You are deploying a new version of a model to a production Vertex Al endpoint that is serving traffic. You plan to direct all user traffic to the new model. You need to deploy the model with minimal disruption to your application. What should you do?
Answer options
- A. 1. Create a new endpoint 2. Create a new model. Set it as the default version. Upload the model to Vertex AI Model Registry 3. Deploy the new model to the new endpoint 4. Update Cloud DNS to point to the new endpoint
- B. 1. Create a new endpoint 2. Create a new model. Set the parentModel parameter to the model ID of the currently deployed model and set it as the default version. Upload the model to Vertex AI Model Registry 3. Deploy the new model to the new endpoint, and set the new model to 100% of the traffic.
- C. 1. Create a new model. Set the parentModel parameter to the model ID of the currently deployed model. Upload the model to Vertex AI Model Registry. 2. Deploy the new model to the existing endpoint, and set the new model to 100% of the traffic
- D. 1. Create a new model. Set it as the default version. Upload the model to Vertex AI Model Registry 2. Deploy the new model to the existing endpoint
Correct answer: C
Explanation
The correct answer is C because it allows for a seamless transition by deploying the new model to the existing endpoint while using the parentModel parameter to reference the currently deployed model, ensuring compatibility. Options A and B involve creating a new endpoint, which adds unnecessary complexity, and D does not use the parentModel parameter, which could lead to issues with model compatibility.