Your organization manages an online message board. A few months ago, you discovered an in…

Question

Your organization manages an online message board. A few months ago, you discovered an increase in toxic language and bullying on the message board. You deployed an automated text classifier that flags certain comments as toxic or harmful. Now some users are reporting that benign comments referencing their religion are being misclassified as abusive. Upon further inspection, you find that your classifier's false positive rate is higher for comments that reference certain underrepresented religious groups. Your team has a limited budget and is already overextended. What should you do?

Accepted Answer

Correct answer: A. A. Add synthetic training data where those phrases are used in non-toxic ways. — The correct choice is A because adding synthetic training data helps improve the model's performance by teaching it to recognize benign uses of phrases associated with underrepresented religious groups. Option B would not be feasible given the limited budget and might not address the underlying issue. Option C does not guarantee better performance and may incur additional costs, while option D might lead to a higher number of genuinely harmful comments going undetected.

Google Cloud Professional Machine Learning Engineer — Question 153

Answer options

Correct answer: A

Explanation