AWS Certified Data Engineer – Associate (DEA-C01) — Question 253

A company maintains a data warehouse in an on-premises Oracle database. The company wants to build a data lake on AWS. The company wants to load data warehouse tables into Amazon S3 and synchronize the tables with incremental data that arrives from the data warehouse every day.

Each table has a column that contains monotonically increasing values. The size of each table is less than 50 GB. The data warehouse tables are refreshed every night between 1 AM and 2 AM. A business intelligence team queries the tables between 10 AM and 8 PM every day.

Which solution will meet these requirements in the MOST operationally efficient way?

Answer options

Correct answer: B

Explanation

Option B is correct because it efficiently manages incremental data updates through a job bookmark that tracks changes based on the monotonically increasing column, allowing for seamless appending of new data. Other options either do not leverage the incremental load capability effectively or suggest overwriting data, which is less efficient and can lead to data loss or downtime for the BI team.