AWS Certified Data Analytics – Specialty — Question 38

A company wants to improve the data load time of a sales data dashboard. Data has been collected as .csv files and stored within an Amazon S3 bucket that is partitioned by date. The data is then loaded to an Amazon Redshift data warehouse for frequent analysis. The data volume is up to 500 GB per day.
Which solution will improve the data loading performance?

Answer options

Correct answer: B

Explanation

Option B is correct because using the COPY command with smaller, split .csv files significantly enhances data loading performance in Amazon Redshift compared to using INSERT statements. The other options either do not optimize data loading effectively or introduce additional steps that can slow down the process.