AWS Certified Data Analytics – Specialty — Question 65
A marketing company is storing its campaign response data in Amazon S3. A consistent set of sources has generated the data for each campaign. The data is saved into Amazon S3 as .csv files. A business analyst will use Amazon Athena to analyze each campaign's data. The company needs the cost of ongoing data analysis with Athena to be minimized.
Which combination of actions should a data analytics specialist take to meet these requirements? (Choose two.)
Answer options
- A. Convert the .csv files to Apache Parquet.
- B. Convert the .csv files to Apache Avro.
- C. Partition the data by campaign.
- D. Partition the data by source.
- E. Compress the .csv files.
Correct answer: A, C
Explanation
Converting .csv files to Apache Parquet (Option A) reduces storage costs and improves query performance due to its columnar format. Additionally, partitioning the data by campaign (Option C) optimizes query efficiency, as Athena can read only the relevant partitions. The other options, while potentially beneficial, do not directly address cost minimization as effectively as these two actions.