CompTIA DataX (DY0-001) — Question 81

A data scientist needs to analyze a company's chemical businesses and is using the master database of the conglomerate company. Nothing in the data differentiates the data observations for the different businesses. Which of the following is the most efficient way to identify the chemical businesses' observations?

Answer options

Correct answer: C

Explanation

Option C is the most efficient approach as it involves consulting with the business team to directly identify the relevant sites, thereby saving time and resources by focusing only on necessary data. Options A and B require analyzing all data, which is inefficient without prior knowledge of the relevant observations. Option D suggests ingesting the largest data set, which may contain irrelevant information and does not guarantee pertinent results.