AWS Certified Data Engineer – Associate (DEA-C01) — Question 94
An online retail company has an application that runs on Amazon EC2 instances that are in a VPC. The company wants to collect flow logs for the VPC and analyze network traffic.
Which solution will meet these requirements MOST cost-effectively?
Answer options
- A. Publish flow logs to Amazon CloudWatch Logs. Use Amazon Athena for analytics.
- B. Publish flow logs to Amazon CloudWatch Logs. Use an Amazon OpenSearch Service cluster for analytics.
- C. Publish flow logs to Amazon S3 in text format. Use Amazon Athena for analytics.
- D. Publish flow logs to Amazon S3 in Apache Parquet format. Use Amazon Athena for analytics.
Correct answer: D
Explanation
The correct answer is D because storing flow logs in Apache Parquet format on Amazon S3 is more storage-efficient and optimized for analytical queries when using Amazon Athena. Options A and B involve CloudWatch Logs, which can be more expensive for log storage and analysis, while option C, although using S3, does not take advantage of the efficient Parquet format.