AWS Certified Machine Learning – Specialty — Question 296

A data scientist is building a forecasting model for a retail company by using the most recent 5 years of sales records that are stored in a data warehouse. The dataset contains sales records for each of the company’s stores across five commercial regions. The data scientist creates a working dataset with StoreID. Region. Date, and Sales Amount as columns. The data scientist wants to analyze yearly average sales for each region. The scientist also wants to compare how each region performed compared to average sales across all commercial regions.

Which visualization will help the data scientist better understand the data trend?

Answer options

Correct answer: D

Explanation

To analyze and compare regional performance, the data must be aggregated at the region level rather than the store level, which rules out options A and B. Faceting the bar plot by year allows the data scientist to easily observe yearly trends across all five regions. Adding a horizontal line representing the overall average sales across each facet provides a clean, direct visual baseline for comparing individual regional performance against the global average.