SAS Statistical Business Analysis Using SAS 9: Regression and Modeling — Question 19
An analyst has a sufficient volume of data to perform a 3-way partition of the data into training, validation, and test sets to perform honest assessment during the model building process.
What is the purpose of the training data set?
Answer options
- A. To provide an unbiased measure of assessment for the final model.
- B. To compare models and select and fine-tune the final model.
- C. To reduce total sample size to make computations more efficient.
- D. To build the predictive models.
Correct answer: A
Explanation
The correct answer is D, as the training dataset is specifically used to build the predictive models by enabling the algorithm to learn from the data. Option A refers to the validation or test datasets, which are used for assessing the model's performance, while options B and C do not accurately describe the purpose of the training data.