Designing an Azure Data Solution (legacy) — Question 40
You are planning an Azure solution that will aggregate streaming data.
The input data will be retrieved from tab-separated values (TSV) files in Azure Blob storage.
You need to output the maximum value from a specific column for every two-minute period in near real-time. The output must be written to Blob storage as a
Parquet file.
What should you use?
Answer options
- A. Azure Data Factory and mapping data flows
- B. Azure Data Factory and wrangling data flows
- C. Azure Stream Analytics window functions
- D. Azure Databricks and Apache Spark SQL window functions
Correct answer: C
Explanation
The correct answer is C, Azure Stream Analytics window functions, as it is specifically designed for real-time data processing and can efficiently compute windowed aggregations like maximum values. The other options, while useful for data processing, are not optimized for real-time analytics on streaming data and would not meet the requirement for immediate output.