Designing and Implementing a Data Science Solution on Azure — Question 1
Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, while others might not have a correct solution.
After you answer a question in this section, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.
You are analyzing a numerical dataset which contains missing values in several columns.
You must clean the missing values using an appropriate operation without affecting the dimensionality of the feature set.
You need to analyze a full dataset to include all values.
Solution: Calculate the column median value and use the median value as the replacement for any missing value in the column.
Does the solution meet the goal?
Answer options
- A. Yes
- B. No
Correct answer: A
Explanation
The solution is correct because replacing missing values with the column median does not alter the dimensionality of the feature set, allowing for a complete analysis of the dataset. Option B is incorrect as it suggests that this method would not meet the goal, which it does by ensuring all data points are accounted for without changing the dataset's structure.