AWS Certified Machine Learning – Specialty — Question 350
A car company has dealership locations in multiple cities. The company uses a machine learning (ML) recommendation system to market cars to its customers.
An ML engineer trained the ML recommendation model on a dataset that includes multiple attributes about each car. The dataset includes attributes such as car brand, car type, fuel efficiency, and price.
The ML engineer uses Amazon SageMaker Data Wrangler to analyze and visualize data. The ML engineer needs to identify the distribution of car prices for a specific type of car.
Which type of visualization should the ML engineer use to meet these requirements?
Answer options
- A. Use the SageMaker Data Wrangler scatter plot visualization to inspect the relationship between the car price and type of car.
- B. Use the SageMaker Data Wrangler quick model visualization to quickly evaluate the data and produce importance scores for the car price and type of car.
- C. Use the SageMaker Data Wrangler anomaly detection visualization to Identify outliers for the specific features.
- D. Use the SageMaker Data Wrangler histogram visualization to inspect the range of values for the specific feature.
Correct answer: D
Explanation
A histogram is the ideal visualization tool in Amazon SageMaker Data Wrangler to analyze the distribution and range of values for a continuous feature like car price. Scatter plots are designed to show the relationship between two variables rather than the distribution of one, while anomaly detection is used specifically to identify outlier data points. Quick models are utilized to evaluate predictive power and generate feature importance scores, not to visualize value distributions.