AWS Certified Machine Learning Engineer – Associate (MLA-C01) — Question 55

An ML engineer receives datasets that contain missing values, duplicates, and extreme outliers. The ML engineer must consolidate these datasets into a single data frame and must prepare the data for ML.
Which solution will meet these requirements?

Answer options

Correct answer: A

Explanation

The correct answer is A because Amazon SageMaker Data Wrangler is specifically designed to handle data preparation tasks, including cleansing and enrichment, which is essential for addressing missing values, duplicates, and outliers. The other options do not provide the necessary functionalities for data cleansing and preparation, making them less suitable for the task at hand.