A company is using Amazon Redshift to build a data warehouse solution. The company is loa…

Question

A company is using Amazon Redshift to build a data warehouse solution. The company is loading hundreds of files into a fact table that is in a Redshift cluster. The company wants the data warehouse solution to achieve the greatest possible throughput. The solution must use cluster resources optimally when the company loads data into the fact table. Which solution will meet these requirements?

Accepted Answer

Correct answer: D. D. Use a single COPY command to load the data into the Redshift cluster. — The correct answer is D, as a single COPY command is optimized for high throughput in Redshift, allowing efficient data loading directly from S3. Options A and C involve multiple commands or INSERT statements, which are less efficient and can lead to bottlenecks. Option B uses HDFS, which adds unnecessary complexity and does not leverage Redshift's strengths in data loading.

AWS Certified Data Engineer – Associate (DEA-C01) — Question 117

Answer options

Correct answer: D

Explanation