Google Cloud Professional Machine Learning Engineer — Question 153
Your organization manages an online message board. A few months ago, you discovered an increase in toxic language and bullying on the message board. You deployed an automated text classifier that flags certain comments as toxic or harmful. Now some users are reporting that benign comments referencing their religion are being misclassified as abusive. Upon further inspection, you find that your classifier's false positive rate is higher for comments that reference certain underrepresented religious groups. Your team has a limited budget and is already overextended. What should you do?
Answer options
- A. Add synthetic training data where those phrases are used in non-toxic ways.
- B. Remove the model and replace it with human moderation.
- C. Replace your model with a different text classifier.
- D. Raise the threshold for comments to be considered toxic or harmful.
Correct answer: A
Explanation
The correct choice is A because adding synthetic training data helps improve the model's performance by teaching it to recognize benign uses of phrases associated with underrepresented religious groups. Option B would not be feasible given the limited budget and might not address the underlying issue. Option C does not guarantee better performance and may incur additional costs, while option D might lead to a higher number of genuinely harmful comments going undetected.