AWS Certified Data Engineer – Associate (DEA-C01) — Question 240
A gaming company uses AWS Glue to perform read and write operations on Apache Iceberg tables for real-time streaming data. The data in the Iceberg tables is in Apache Parquet format. The company is experiencing slow query performance.
Which solutions will improve query performance? (Choose two.)
Answer options
- A. Use AWS Glue Data Catalog to generate column-level statistics for the Iceberg tables on a schedule.
- B. Use AWS Glue Data Catalog to automatically compact the Iceberg tables.
- C. Use AWS Glue Data Catalog to automatically optimize indexes for the Iceberg tables.
- D. Use AWS Glue Data Catalog to enable copy-on-write for the Iceberg tables.
- E. Use AWS Glue Data Catalog to generate views for the Iceberg tables.
Correct answer: B, D
Explanation
Options B and D are correct because automatically compacting the Iceberg tables helps to reduce the number of files and improves read efficiency, while enabling copy-on-write optimizes data storage and retrieval. Options A, C, and E do not directly address the performance issues related to query execution speed.