An ML engineer is training a text generation model on Amazon SageMaker AI. After several…

Question

An ML engineer is training a text generation model on Amazon SageMaker AI. After several epochs, the loss function does not converge, and the model’s accuracy on the validation dataset starts to show oscillating results. The ML engineer needs to ensure that the model achieves generalization. Which solution will meet this requirement?

Accepted Answer

Correct answer: C. C. Decrease the learning rate and increase the mini-batch size. — The correct answer is C because decreasing the learning rate helps stabilize the training process by allowing finer adjustments to the model weights, while increasing the mini-batch size helps provide a more accurate estimate of the gradient, leading to better generalization. The other options either increase the learning rate, which can cause instability, or do not help the model generalize effectively.

AWS Certified Machine Learning Engineer – Associate (MLA-C01) — Question 184

Answer options

Correct answer: C

Explanation