AWS Certified Solutions Architect – Associate (SAA-C03) — Question 743

A company has stored 10 TB of log files in Apache Parquet format in an Amazon S3 bucket. The company occasionally needs to use SQL to analyze the log files.

Which solution will meet these requirements MOST cost-effectively?

Answer options

Correct answer: C

Explanation

Amazon Athena is a serverless, interactive query service that allows users to run SQL queries directly on Amazon S3 data, charging only for the data scanned, which is highly cost-effective for occasional querying. AWS Glue crawlers can automatically discover and catalog the schema of the Apache Parquet files to make them queryable by Athena. The alternative options involving Amazon Aurora, Amazon Redshift, and Amazon EMR require provisioning and paying for running database instances or clusters, making them significantly more expensive for infrequent use.