Designing and Implementing Enterprise-Scale Analytics Using Microsoft Azure and Power BI — Question 109
You have a deployment pipeline for a Power BI workspace. The workspace contains two datasets that use Import storage mode.
A database administrator reports a drastic increase in the number of queries sent from the Power BI service to an Azure SQL database since the creation of the deployment pipeline.
An investigation into the issue identifies the following:
One of the datasets is larger than 1 GB and has a fact table that contains more than 500 million rows.
When publishing dataset changes to development, test, or production pipelines, a refresh is triggered against the entire dataset.
You need to recommend a solution to reduce the size of the queries sent to the database when the dataset changes are published to development, test, or production.
What should you recommend?
Answer options
- A. Enable the large dataset storage format for workspace.
- B. Create a dataset parameter to reduce the fact table row count in the development and test pipelines.
- C. Request the authors of the deployment pipeline datasets to reduce the number of datasets republished during development.
- D. Turn off auto refresh when publishing the dataset changes to the Power BI service.
Correct answer: B
Explanation
The correct answer is B because creating a dataset parameter allows for limiting the number of rows in the fact table during development and testing, which reduces the load on the database. Option A, while useful for managing large datasets, does not directly address the query size issue. Option C does not solve the problem of excessive queries as it focuses on dataset management rather than reducing the queries themselves. Option D would prevent any refresh, potentially leading to data that is not up-to-date, which is not a suitable solution.