IBM PureData System for Analytics V7.0 — Question 2
What is the primary purpose of the Partition node in a modeling effort?
Answer options
- A. Divide the data into training and testing data.
- B. Increase the proportion of under-represented subgroups within the data.
- C. Decrease the proportion of an over-represented subgroup within the data.
- D. Allow for equal sampling across subgroups within the data.
Correct answer: C
Explanation
The correct answer is C, as the primary goal of the Partition node is to manage the representation of different subgroups in the dataset, specifically by decreasing the presence of over-represented groups. Options A and D incorrectly describe the functionality of the Partition node, while B focuses on increasing representation, which is not its primary purpose.