Designing and Implementing Enterprise-Scale Analytics Using Microsoft Azure and Power BI — Question 105
You are using a Python notebook in an Apache Spark pool in Azure Synapse Analytics.
You need to present the data distribution statistics from a DataFrame in a tabular view.
Which method should you invoke on the DataFrame?
Answer options
- A. rollup
- B. freqItems
- C. explain
- D. describe
Correct answer: D
Explanation
The correct answer is D, as the 'describe' method provides a summary of statistics for the DataFrame, including count, mean, and standard deviation. The 'rollup' and 'freqItems' methods serve different purposes related to aggregation and frequency counts, while 'explain' is used for understanding the execution plan of a query, not for statistical data presentation.