Designing and Implementing Enterprise-Scale Analytics Using Microsoft Azure and Power BI — Question 107

You are using a Python notebook in an Apache Spark pool in Azure Synapse Analytics.
You need to present the data distribution statistics from a Data Frame in a tabular view.
Which method should you invoke on the Data Frame?

Answer options

Correct answer: B

Explanation

The correct method to use is 'describe', as it provides summary statistics of the Data Frame, including count, mean, and standard deviation. The 'sample' method is used to retrieve a random sample of the data, 'freqItems' is for finding frequent items in a column, and 'explain' is for displaying the logical plan of the Data Frame operations.