AWS Certified Machine Learning Engineer – Associate (MLA-C01) — Question 107
An ML engineer notices class imbalance in an image classification training job.
What should the ML engineer do to resolve this issue?
Answer options
- A. Reduce the size of the dataset.
- B. Transform some of the images in the dataset.
- C. Apply random oversampling on the dataset.
- D. Apply random data splitting on the dataset.
Correct answer: C
Explanation
Applying random oversampling on the dataset helps to balance the class distribution by duplicating instances of the underrepresented classes, which can lead to improved model performance. Reducing the dataset size (A) may worsen the imbalance, while transforming images (B) does not directly address class distribution. Random data splitting (D) does not resolve class imbalance issues either.