AWS Certified Machine Learning – Specialty — Question 257
A machine learning (ML) specialist uploads 5 TB of data to an Amazon SageMaker Studio environment. The ML specialist performs initial data cleansing. Before the ML specialist begins to train a model, the ML specialist needs to create and view an analysis report that details potential bias in the uploaded data.
Which combination of actions will meet these requirements with the LEAST operational overhead? (Choose two.)
Answer options
- A. Use SageMaker Clarify to automatically detect data bias
- B. Turn on the bias detection option in SageMaker Ground Truth to automatically analyze data features.
- C. Use SageMaker Model Monitor to generate a bias drift report.
- D. Configure SageMaker Data Wrangler to generate a bias report.
- E. Use SageMaker Experiments to perform a data check
Correct answer: A, D
Explanation
The correct actions are A and D. SageMaker Clarify is specifically designed to detect bias in datasets, providing an automated solution, while SageMaker Data Wrangler simplifies data preparation and can generate bias reports effectively. The other options either do not focus on bias detection or involve more complexity and operational overhead.