AWS Certified Data Analytics – Specialty — Question 163

An airline has been collecting metrics on flight activities for analytics. A recently completed proof of concept demonstrates how the company provides insights to data analysts to improve on-time departures. The proof of concept used objects in Amazon S3, which contained the metrics in .csv format, and used Amazon
Athena for querying the data. As the amount of data increases, the data analyst wants to optimize the storage solution to improve query performance.
Which options should the data analyst use to improve performance as the data lake grows? (Choose three.)

Answer options

Correct answer: C, D, F

Explanation

The correct answers are C, D, and F. Compressing the objects reduces the amount of data transferred, improving I/O performance. Using an S3 bucket in the same Region as Athena minimizes latency and enhances query speed. Converting .csv to Apache Parquet optimizes storage and allows for more efficient data access by fetching only necessary data blocks. Options A and E do not provide significant performance improvements compared to the selected options.