A company uses Amazon Redshift for its enterprise data warehouse. A new on-premises Postg…

Question

A company uses Amazon Redshift for its enterprise data warehouse. A new on-premises PostgreSQL OLTP
DB must be integrated into the data warehouse. Each table in the PostgreSQL DB has an indexed timestamp column. The data warehouse has a staging layer to load source data into the data warehouse environment for further processing.
The data lag between the source PostgreSQL DB and the Amazon Redshift staging layer should NOT exceed four hours.
What is the most efficient technique to meet these requirements?

Accepted Answer

Correct answer: C. C. Extract the incremental changes periodically using a SQL query. Upload the changes to multiple Amazon Simple Storage Service (S3) objects, and run the COPY command to load to the Amazon Redshift staging layer. — The correct answer is C because it allows for efficient extraction of incremental changes and the use of multiple S3 objects can optimize the loading process into Amazon Redshift. Option A is less efficient due to the overhead of maintaining a direct connection and triggers. Option B introduces additional complexity with Kinesis Streams, which may not be necessary. Option D may lead to performance issues when loading large amounts of data compared to using multiple S3 objects.

AWS Certified Big Data – Specialty — Question 40

Answer options

Correct answer: C

Explanation