A marketing company receives a large amount of new clickstream data in Amazon S3 from a m…

Question

A marketing company receives a large amount of new clickstream data in Amazon S3 from a marketing campaign. The company needs to analyze the clickstream data in Amazon S3 quickly. Then the company needs to determine whether to process the data further in the data pipeline. Which solution will meet these requirements with the LEAST operational overhead?

Accepted Answer

Correct answer: B. B. Configure an AWS Glue crawler to crawl the data. Configure Amazon Athena to query the data. — Amazon Athena is a serverless, interactive query service that allows for direct querying of Amazon S3 data using standard SQL without needing to manage infrastructure, resulting in the lowest operational overhead. Combining Athena with an AWS Glue crawler simplifies schema discovery and cataloging automatically. Solutions involving Amazon EMR, Spark jobs, or Amazon Kinesis Data Analytics introduce unnecessary architectural complexity and management overhead for simple ad-hoc data analysis.

AWS Certified Solutions Architect – Associate (SAA-C03) — Question 634

Answer options

Correct answer: B

Explanation