AWS Certified Machine Learning – Specialty — Question 44
A retail company intends to use machine learning to categorize new products. A labeled dataset of current products was provided to the Data Science team. The dataset includes 1,200 products. The labeled dataset has 15 features for each product such as title dimensions, weight, and price. Each product is labeled as belonging to one of six categories such as books, games, electronics, and movies.
Which model should be used for categorizing new products using the provided dataset for training?
Answer options
- A. AnXGBoost model where the objective parameter is set to multi:softmax
- B. A deep convolutional neural network (CNN) with a softmax activation function for the last layer
- C. A regression forest where the number of trees is set equal to the number of product categories
- D. A DeepAR forecasting model based on a recurrent neural network (RNN)
Correct answer: A
Explanation
The correct answer is A because XGBoost is well-suited for multi-class classification tasks, particularly with structured data like the provided dataset. Options B and D focus on deep learning approaches that may not be necessary for the structured features available, while C suggests a regression forest, which is not appropriate for a classification problem.