IBM PureData System for Analytics V7.0 — Question 11

You want to obtain a subset of data from a larger data set, with equally represented subgroups within the subset.
Which node would you use to accomplish this task?

Answer options

Correct answer: D

Explanation

The correct answer is D, Sample node, because it is specifically designed to create subsets of data while maintaining the representation of different subgroups. The other options, such as Analysis node, Partition node, and Ensemble node, either focus on different tasks or do not ensure equal representation of subgroups in the subset.