AWS Certified Data Analytics – Specialty — Question 7

Once a month, a company receives a 100 MB .csv file compressed with gzip. The file contains 50,000 property listing records and is stored in Amazon S3 Glacier.
The company needs its data analyst to query a subset of the data for a specific vendor.
What is the most cost-effective solution?

Answer options

Correct answer: A

Explanation

The most cost-effective solution is to load the data into Amazon S3 and utilize Amazon S3 Select, as it allows you to query only the required data without the need to retrieve the entire dataset from Glacier. Options B, C, and D involve either higher retrieval costs or more complex querying services that are unnecessary for the task at hand.