AWS Certified Big Data – Specialty — Question 40

A company uses Amazon Redshift for its enterprise data warehouse. A new on-premises PostgreSQL OLTP
DB must be integrated into the data warehouse. Each table in the PostgreSQL DB has an indexed timestamp column. The data warehouse has a staging layer to load source data into the data warehouse environment for further processing.
The data lag between the source PostgreSQL DB and the Amazon Redshift staging layer should NOT exceed four hours.
What is the most efficient technique to meet these requirements?

Answer options

Correct answer: C

Explanation

The correct answer is C because it allows for efficient extraction of incremental changes and the use of multiple S3 objects can optimize the loading process into Amazon Redshift. Option A is less efficient due to the overhead of maintaining a direct connection and triggers. Option B introduces additional complexity with Kinesis Streams, which may not be necessary. Option D may lead to performance issues when loading large amounts of data compared to using multiple S3 objects.