ISACA Certified Artificial Intelligence Auditor (CAIA) — Question 14

An IS auditor learns that the training data for an AI solution has insufficient representation. Which of the following would be the GREATEST risk in this situation?

Answer options

Correct answer: A

Explanation

The greatest risk of insufficient representation in training data is imbalance, as it can lead to biased models that perform poorly on underrepresented classes. Underfitting and overfitting are related to model complexity rather than data representation, and clustering does not directly relate to the risks posed by the quality of training data.