ISACA Certified Artificial Intelligence Auditor (CAIA) — Question 14
An IS auditor learns that the training data for an AI solution has insufficient representation. Which of the following would be the GREATEST risk in this situation?
Answer options
- A. Imbalance
- B. Underfitting
- C. Overfitting
- D. Clustering
Correct answer: A
Explanation
The greatest risk of insufficient representation in training data is imbalance, as it can lead to biased models that perform poorly on underrepresented classes. Underfitting and overfitting are related to model complexity rather than data representation, and clustering does not directly relate to the risks posed by the quality of training data.