Google Cloud Professional Machine Learning Engineer — Question 60
You work for a large retailer and have been asked to segment your customers by their purchasing habits. The purchase history of all customers has been uploaded to BigQuery. You suspect that there may be several distinct customer segments, however you are unsure of how many, and you don’t yet understand the commonalities in their behavior. You want to find the most efficient solution. What should you do?
Answer options
- A. Create a k-means clustering model using BigQuery ML. Allow BigQuery to automatically optimize the number of clusters.
- B. Create a new dataset in Dataprep that references your BigQuery table. Use Dataprep to identify similarities within each column.
- C. Use the Data Labeling Service to label each customer record in BigQuery. Train a model on your labeled data using AutoML Tables. Review the evaluation metrics to understand whether there is an underlying pattern in the data.
- D. Get a list of the customer segments from your company’s Marketing team. Use the Data Labeling Service to label each customer record in BigQuery according to the list. Analyze the distribution of labels in your dataset using Data Studio.
Correct answer: A
Explanation
The correct answer, A, suggests using a k-means clustering model with BigQuery ML, which is efficient for automatically determining the optimal number of clusters based on purchasing habits. Options B and C focus on data preparation and labeling, which are not directly aimed at identifying distinct segments. Option D relies on predefined segments from the Marketing team, which may not reveal new insights into customer behavior.