AWS Certified AI Practitioner (AIF-C01) — Question 204
A retail company wants to build an ML model to recommend products to customers. The company wants to build the model based on responsible practices.
Which practice should the company apply when collecting data to decrease model bias?
Answer options
- A. Use data from only customers who match the demographics of the company's overall customer base.
- B. Collect data from customers who have a past purchase history.
- C. Ensure that the data is balanced and collected from a diverse group.
- D. Ensure that the data is from a publicly available dataset.
Correct answer: C
Explanation
The correct answer, C, emphasizes the importance of having a balanced dataset that includes diverse demographics, which helps reduce bias in the model. Option A restricts data collection to a narrow demographic, which can increase bias. Option B limits data to only those with a purchase history, potentially excluding valuable insights from new customers. Option D does not guarantee diversity or balance, as publicly available datasets may not represent the company's customer base accurately.