AWS Certified Machine Learning – Specialty — Question 356

A machine learning (ML) specialist collected daily product usage data for a group of customers. The ML specialist appended customer metadata such as age and gender from an external data source.

The ML specialist wants to understand product usage patterns for each day of the week for customers in specific age groups. The ML specialist creates two categorical features named dayofweek and binned_age, respectively.

Which approach should the ML specialist use discover the relationship between the two new categorical features?

Answer options

Correct answer: B

Explanation

Crosstabs (cross-tabulations) are the standard statistical tool for analyzing the relationship between two categorical variables, as they display the frequency distribution of the variables in a matrix. Scatterplots and boxplots require at least one continuous numerical variable to be effective, while word clouds are designed for visualizing textual data frequency rather than categorical relationships.