AWS Certified Machine Learning Engineer – Associate (MLA-C01) — Question 184

An ML engineer is training a text generation model on Amazon SageMaker AI. After several epochs, the loss function does not converge, and the model’s accuracy on the validation dataset starts to show oscillating results.

The ML engineer needs to ensure that the model achieves generalization.

Which solution will meet this requirement?

Answer options

Correct answer: C

Explanation

The correct answer is C because decreasing the learning rate helps stabilize the training process by allowing finer adjustments to the model weights, while increasing the mini-batch size helps provide a more accurate estimate of the gradient, leading to better generalization. The other options either increase the learning rate, which can cause instability, or do not help the model generalize effectively.