Databricks Certified Machine Learning Professional — Question 61
A machine learning engineer is monitoring label values for a production machine learning classification model. The engineer believes that the relative prevalence of the classes is becoming changing in more recent data.
Which tool can the machine learning engineer use to assess their theory?
Answer options
- A. One-way Chi-squared Test
- B. Jenson-Shannon distance
- C. Two-way Chi-squared Test
- D. Kolmogorov-Smirnov (KS) test
Correct answer: A
Explanation
The One-way Chi-squared Test is suitable for determining whether there are significant differences between expected and observed frequencies in categorical data, making it ideal for assessing class prevalence changes. The other options, while useful for various statistical analysis, do not specifically address the analysis of a single categorical variable's distribution over time.