AWS Certified Data Engineer – Associate (DEA-C01) — Question 228
A data engineer notices slow query performance on a highly partitioned table that is in Amazon Athena. The table contains daily data for the previous 5 years, partitioned by date.
The data engineer wants to improve query performance and to automate partition management.
Which solution will meet these requirements?
Answer options
- A. Use an AWS Lambda function that runs daily. Configure the function to manually create new partitions in AWS Glue for each day’s data.
- B. Use partition projection in Athena. Configure the table properties by using a date range from 5 years ago to the present.
- C. Reduce the number of partitions by changing the partitioning schema from daily to monthly granularity.
- D. Increase the processing capacity of Athena queries by allocating more compute resources.
Correct answer: B
Explanation
The correct answer is B because partition projection allows Athena to automatically manage partitions without needing to create them manually, which enhances query performance. Option A involves manual partition creation, which does not automate the process. Option C may improve performance but could lead to data management challenges with fewer partitions. Option D increases resources but does not address the underlying partitioning inefficiencies.