A company is building a real-time data processing pipeline for an ecommerce application.…

Question

A company is building a real-time data processing pipeline for an ecommerce application. The application generates a high volume of clickstream data that must be ingested, processed, and visualized in near real time. The company needs a solution that supports SQL for data processing and Jupyter notebooks for interactive analysis. Which solution will meet these requirements?

Accepted Answer

Correct answer: D. D. Use Amazon Managed Streaming for Apache Kafka (Amazon MSK) to ingest the data. Use Amazon Managed Service for Apache Flink to process the data. Use the built-in Flink dashboard to visualize the data. — Option D is correct because Amazon Managed Service for Apache Flink supports real-time processing and visualization, which aligns with the company's needs for near real-time interaction. The other options either do not support the required SQL processing or do not provide an adequate visualization method that meets the near real-time requirement.

AWS Certified Machine Learning Engineer – Associate (MLA-C01) — Question 102

Answer options

Correct answer: D

Explanation