AWS Certified Data Engineer – Associate (DEA-C01) — Question 258
Files from multiple data sources arrive in an Amazon S3 bucket on a regular basis. A data engineer wants to ingest new files into Amazon Redshift in near real time when the new files arrive in the S3 bucket.
Which solution will meet these requirements?
Answer options
- A. Use the query editor v2 to schedule a COPY command to load new files into Amazon Redshift.
- B. Use the zero-ETL integration between Amazon Aurora and Amazon Redshift to load new files into Amazon Redshift.
- C. Use AWS Glue job bookmarks to extract, transform, and load (ETL) load new files into Amazon Redshift.
- D. Use S3 Event Notifications to invoke an AWS Lambda function that loads new files into Amazon Redshift.
Correct answer: D
Explanation
The correct answer is D because using S3 Event Notifications triggers an AWS Lambda function whenever new files are uploaded, allowing for immediate ingestion into Amazon Redshift. Options A and C involve scheduled or batch processes, which do not provide near real-time data loading. Option B is not applicable as it pertains to integration between Amazon Aurora and Amazon Redshift rather than directly loading files from S3.