IBM PureData System for Analytics V7.0 — Question 12
You have a large amount of data from which you want to build a model. Although many of the records of data are complete, there are substantial amounts of records which contain missing data. The records containing incomplete information should be excluded from analysis.
Which node will exclude the undesired records?
Answer options
- A. Filler node
- B. Filter node
- C. Select node
- D. Aggregate node
Correct answer: C
Explanation
The correct answer is C, the Select node, as it is designed to filter out records based on specified criteria, including the presence of missing data. The Filler node is used for filling in missing values, the Filter node typically removes records based on conditions but not specifically for missing data, and the Aggregate node is used for summarizing data, not excluding records.