Designing and Implementing Enterprise-Scale Analytics Using Microsoft Azure and Power BI — Question 108
You have a deployment pipeline for a Power BI workspace. The workspace contains two datasets that use import storage mode.
A database administrator reports a drastic increase in the number of queries sent from the Power Bi service to an Azure SQL database since the creation of the deployment pipeline.
An investigation into the issue identifies the following:
One of the datasets is larger than 1 GB and has a fact table that contains more than 500 million rows.
When publishing dataset changes to development, test, or production pipelines, a refresh is triggered against the entire dataset.
You need to recommend a solution to reduce the size of the queries sent to the database when the dataset changes are published to development, test, or production.
What should you recommend?
Answer options
- A. Request the authors of the deployment pipeline datasets to reduce the number of datasets republished during development.
- B. In the dataset, delete the fact table.
- C. Configure the dataset to use a composite model that has a DirectQuery connection to the fact table.
- D. From Capacity settings in the Power Bi Admin portal, reduce the Max Intermediate Row Set Count setting.
Correct answer: C
Explanation
The correct answer is C because using a composite model with a DirectQuery connection allows for more efficient querying, reducing the load on the database when datasets are published. Option A does not address the issue of the large dataset itself, while option B would remove critical data needed for analysis. Option D may help with performance but does not directly tackle the root cause of the excessive queries triggered by dataset refreshes.