AWS Certified Solutions Architect – Associate (SAA-C02) — Question 613
A company wants to build a data lake on AWS from data that is stored in an on-premises Oracle relational database. The data lake must receive ongoing updates from the on-premises database.
Which solution will meet these requirements with the LEAST operational overhead?
Answer options
- A. Use AWS DataSync to transfer the data to Amazon S3. Use AWS Glue to transform the data and integrate the data into a data lake.
- B. Use AWS Snowball to transfer the data to Amazon S3. Use AWS Batch to transform the data and integrate the data into a data lake.
- C. Use AWS Database Migration Service (AWS DMS) to transfer the data to Amazon S3. Use AWS Glue to transform the data and integrate the data into a data lake.
- D. Use an Amazon EC2 instance to transfer the data to Amazon S3. Configure the EC2 instance to transform the data and integrate the data into a data lake.
Correct answer: C
Explanation
AWS Database Migration Service (AWS DMS) is the optimal choice for capturing ongoing changes (Change Data Capture) from an on-premises Oracle database and replicating them to Amazon S3 with minimal operational overhead. Using AWS Glue, a serverless data integration service, allows the data to be transformed and organized into the S3 data lake without managing infrastructure. Other options like AWS DataSync, AWS Snowball, or custom EC2 instances either lack native database replication features or require significant manual configuration and maintenance.