IBM PureData System for Analytics V7.0 — Question 11
You want to obtain a subset of data from a larger data set, with equally represented subgroups within the subset.
Which node would you use to accomplish this task?
Answer options
- A. Analysis node
- B. Partition node
- C. Ensemble node
- D. Sample node
Correct answer: D
Explanation
The correct answer is D, Sample node, because it is specifically designed to create subsets of data while maintaining the representation of different subgroups. The other options, such as Analysis node, Partition node, and Ensemble node, either focus on different tasks or do not ensure equal representation of subgroups in the subset.