AWS Certified Machine Learning – Specialty — Question 350

A car company has dealership locations in multiple cities. The company uses a machine learning (ML) recommendation system to market cars to its customers.

An ML engineer trained the ML recommendation model on a dataset that includes multiple attributes about each car. The dataset includes attributes such as car brand, car type, fuel efficiency, and price.

The ML engineer uses Amazon SageMaker Data Wrangler to analyze and visualize data. The ML engineer needs to identify the distribution of car prices for a specific type of car.

Which type of visualization should the ML engineer use to meet these requirements?

Answer options

Correct answer: D

Explanation

A histogram is the ideal visualization tool in Amazon SageMaker Data Wrangler to analyze the distribution and range of values for a continuous feature like car price. Scatter plots are designed to show the relationship between two variables rather than the distribution of one, while anomaly detection is used specifically to identify outlier data points. Quick models are utilized to evaluate predictive power and generate feature importance scores, not to visualize value distributions.