Designing Azure Infrastructure Solutions — Question 89
You have an Azure subscription that contains an Azure Cosmos DB for NoSQL account named account1 and an Azure Synapse Analytics workspace named Workspace1. The account1 account contains a container named Contained that has the analytical store enabled.
You need to recommend a solution that will process the data stored in Contained in near-real-time (NRT) and output the results to a data warehouse in Workspace1 by using a runtime engine in the workspace. The solution must minimize data movement.
Which pool in Workspace1 should you use?
Answer options
- A. Apache Spark
- B. serverless SQL
- C. dedicated SQL
- D. Data Explorer
Correct answer: A
Explanation
The correct answer is A, Apache Spark, because it is well-suited for processing large volumes of data in near-real-time scenarios, particularly when working with analytical stores. The other options, while useful for different purposes, do not provide the same level of performance and flexibility for real-time data processing in this context.