AWS Certified Data Engineer – Associate (DEA-C01) — Question 196

A company is using Amazon S3 to build a data lake. The company needs to replicate records from multiple source databases into Apache Parquet format.

Most of the source databases are hosted on Amazon RDS. However, one source database is an on-premises Microsoft SQL Server Enterprise instance. The company needs to implement a solution to replicate existing data from all source databases and all future changes to the target S3 data lake.

Which solution will meet these requirements MOST cost-effectively?

Answer options

Correct answer: C

Explanation

The correct answer is C because AWS Database Migration Service (AWS DMS) is designed to efficiently handle both initial data loads and ongoing replication of changes, making it a comprehensive solution. Options A and B do not provide a unified approach for both data replication needs, while option D introduces unnecessary complexity by using Kinesis for future changes instead of a single service.