You are using a Python notebook in an Apache Spark pool in Azure Synapse Analytics. You n…

Question

You are using a Python notebook in an Apache Spark pool in Azure Synapse Analytics.
You need to present the data distribution statistics from a DataFrame in a tabular view.
Which method should you invoke on the DataFrame?

Accepted Answer

Correct answer: D. D. describe — The correct method to use for showing data distribution statistics in a DataFrame is 'describe', as it provides summary statistics for numerical columns. 'rollup' is used for aggregating data, 'cov' computes the covariance between two columns, and 'explain' is intended for understanding the execution plan of a DataFrame operation, rather than providing statistical summaries.

Designing and Implementing Enterprise-Scale Analytics Using Microsoft Azure and Power BI — Question 106

Answer options

Correct answer: D

Explanation