Data Engineering on Microsoft Azure — Question 25
You are planning a solution to aggregate streaming data that originates in Apache Kafka and is output to Azure Data Lake Storage Gen2. The developers who will implement the stream processing solution use Java.
Which service should you recommend using to process the streaming data?
Answer options
- A. Azure Event Hubs
- B. Azure Data Factory
- C. Azure Stream Analytics
- D. Azure Databricks
Correct answer: D
Explanation
Azure Databricks is the most suitable choice for processing streaming data with Java, as it provides a collaborative environment for big data analytics and supports Spark Streaming. Azure Event Hubs is focused on event ingestion, Azure Data Factory is primarily for data orchestration, and Azure Stream Analytics is better suited for SQL-like queries on streaming data rather than Java-based processing.