Designing and Implementing Enterprise-Scale Analytics Using Microsoft Azure and Power BI — Question 79
You are using a Python notebook in an Apache Spark pool in Azure Synapse Analytics.
You need to present the data distribution statistics from a DataFrame in a tabular view.
Which method should you invoke on the DataFrame?
Answer options
- A. freqItems
- B. corr
- C. summary
- D. rollup
Correct answer: C
Explanation
The correct answer is C, as the summary method provides a comprehensive overview of the statistics of a DataFrame in a tabular format. The other options do not serve the purpose of displaying data distribution statistics; freqItems is for frequent item sets, corr calculates correlation, and rollup is used for aggregating data.