Google Cloud Professional Machine Learning Engineer — Question 68
You need to train a regression model based on a dataset containing 50,000 records that is stored in BigQuery. The data includes a total of 20 categorical and numerical features with a target variable that can include negative values. You need to minimize effort and training time while maximizing model performance. What approach should you take to train this regression model?
Answer options
- A. Create a custom TensorFlow DNN model
- B. Use BQML XGBoost regression to train the model.
- C. Use AutoML Tables to train the model without early stopping.
- D. Use AutoML Tables to train the model with RMSLE as the optimization objective.
Correct answer: B
Explanation
The correct answer is B because BQML XGBoost regression is specifically designed for efficient training and can handle the nuances of regression tasks with large datasets. Options A and C may require more manual tuning and do not optimize performance as effectively, while option D focuses on a specific optimization that may not be necessary for the dataset's characteristics.