AWS Certified Data Engineer – Associate (DEA-C01) — Question 207
A company is building a new application that ingests CSV files into Amazon Redshift. The company has developed the frontend for the application.
The files are stored in an Amazon S3 bucket. Files are no larger than 5 MB.
A data engineer is developing the extract, transform, and load (ETL) pipeline for the CSV files. The data engineer configured a Redshift cluster and an AWS Lambda function that copies the data out of the files into the Redshift cluster.
Which additional steps should the data engineer perform to meet these requirements?
Answer options
- A. Configure the bucket to send S3 event notifications to Amazon EventBridge. Configure an EventBridge rule that matches S3 new object created events. Set the Lambda function as the target.
- B. Configure the $3 bucket to send S3 event notifications to an Amazon Simple Queue Service (Amazon SQS) queue. Configure the Lambda function to process the queue.
- C. Configure AWS Database Migration Service (AWS DMS) to stream new S3 objects to a data stream in Amazon Kinesis Data Streams. Set the Lambda function as the target of the data stream.
- D. Configure an Amazon EventBridge rule that matches S3 new object created events. Set an Amazon Simple Queue Service (Amazon SQS) queue as the target of the rule. Configure the Lambda function to process the queue.
Correct answer: B
Explanation
The correct answer is B because using an SQS queue allows the Lambda function to process the CSV files asynchronously as they arrive in the S3 bucket. Option A is incorrect because it does not specify a queue for processing, while C involves unnecessary complexity with AWS DMS and Kinesis, which is not needed for this scenario. Option D is similar to A but incorrectly suggests that the SQS queue would be the target of an EventBridge rule instead of directly processing the S3 events.