AWS Certified Data Engineer – Associate (DEA-C01) — Question 48

An airline company is collecting metrics about flight activities for analytics. The company is conducting a proof of concept (POC) test to show how analytics can provide insights that the company can use to increase on-time departures.
The POC test uses objects in Amazon S3 that contain the metrics in .csv format. The POC test uses Amazon Athena to query the data. The data is partitioned in the S3 bucket by date.
As the amount of data increases, the company wants to optimize the storage solution to improve query performance.
Which combination of solutions will meet these requirements? (Choose two.)

Answer options

Correct answer: C, E

Explanation

The correct answer, C, is essential because keeping the S3 bucket in the same AWS Region as Athena minimizes latency and improves performance. E is also correct as converting .csv to Apache Parquet format allows for more efficient data storage and retrieval, especially since it optimizes queries by only fetching necessary data blocks. Options A, B, and D do not provide the same level of optimization for query performance as C and E do.