Google Cloud Professional Machine Learning Engineer — Question 282
You have developed an AutoML tabular classification model that identifies high-value customers who interact with your organization's website. You plan to deploy the model to a new Vertex AI endpoint that will integrate with your website application. You expect higher traffic to the website during nights and weekends. You need to configure the model endpoint's deployment settings to minimize latency and cost. What should you do?
Answer options
- A. Configure the model deployment settings to use an n1-standard-32 machine type.
- B. Configure the model deployment settings to use an n1-standard-4 machine type. Set the minReplicaCount value to 1 and the maxReplicaCount value to 8.
- C. Configure the model deployment settings to use an n1-standard-4 machine type and a GPU accelerator. Set the minReplicaCount value to 1 and the maxReplicaCount value to 4.
- D. Configure the model deployment settings to use an n1-standard-8 machine type and a GPU accelerator.
Correct answer: B
Explanation
Option B is correct because it balances cost and performance by using an n1-standard-4 machine type with scalable replicas, allowing for efficient handling of increased traffic. Option A proposes a much larger machine type that may incur unnecessary costs. Option C includes a GPU accelerator, which is not needed for a tabular classification model and could lead to higher expenses. Option D suggests a larger machine type with a GPU, which is also excessive in terms of cost for this scenario.