SAS Statistical Business Analysis Using SAS 9: Regression and Modeling — Question 14
While building a predictive model, median imputations are performed while preparing the training data.
How should the imputations be addressed in the validation data?
Answer options
- A. The imputed values are irrelevant to the validation data, and are not used.
- B. The imputed values must be applied directly to the validation data without recalculation.
- C. The imputed values must be recalculated using the validation data.
- D. The imputed values must be recalculated using both the training and the validation data.
Correct answer: C
Explanation
The correct approach is to recalculate the imputed values using the validation data (option C) to ensure that the model's performance is evaluated accurately based on the characteristics of that data. Options A and B are incorrect as they disregard the necessary adjustments needed for validation data. Option D is also incorrect as it suggests using both datasets for recalculation, which can lead to data leakage.