Designing and Implementing Enterprise-Scale Analytics Using Microsoft Azure and Power BI — Question 106

You are using a Python notebook in an Apache Spark pool in Azure Synapse Analytics.
You need to present the data distribution statistics from a DataFrame in a tabular view.
Which method should you invoke on the DataFrame?

Answer options

Correct answer: D

Explanation

The correct method to use for showing data distribution statistics in a DataFrame is 'describe', as it provides summary statistics for numerical columns. 'rollup' is used for aggregating data, 'cov' computes the covariance between two columns, and 'explain' is intended for understanding the execution plan of a DataFrame operation, rather than providing statistical summaries.