AWS Certified Data Engineer – Associate (DEA-C01) — Question 242
A company needs to aggregate and filter a large amount of streaming data in real-time with low latency. The company needs to store the data in Amazon S3 for analysis.
Which solution will meet these requirements in the MOST operationally efficient way?
Answer options
- A. Use Amazon Kinesis Data Streams with provisioned capacity and AWS Lambda functions to perform custom transformations and to integrate with Amazon S3.
- B. Use Amazon Data Firehose with built-in data transformations. Deliver the data directly to Amazon S3.
- C. Use Amazon Kinesis Data Streams and Amazon Managed Service for Apache Flink to perform complex processing and to integrate with Amazon S3.
- D. Use Amazon Data Firehose and AWS Lambda functions to perform custom transformations and to deliver the data to Amazon S3.
Correct answer: C
Explanation
The correct answer is C because Amazon Kinesis Data Streams combined with Amazon Managed Service for Apache Flink allows for complex data processing and real-time analytics, which is essential for the requirements stated. The other options either do not provide the necessary complexity in processing (like options B and D) or rely too heavily on Lambda functions, which may not be as efficient for high-volume data streams (as in option A).