SnowPro Advanced: Data Engineer — Question 33
A Data Engineer is working on a continuous data pipeline which receives data from Amazon Kinesis Firehose and loads the data into a staging table which will later be used in the data transformation process. The average file size is 300-500 MB.
The Engineer needs to ensure that Snowpipe is performant while minimizing costs.
How can this be achieved?
Answer options
- A. Increase the size of the virtual warehouse used by Snowpipe.
- B. Split the files before loading them and set the SIZE_LIMIT option to 250 MB.
- C. Change the file compression size and increase the frequency of the Snowpipe loads.
- D. Decrease the buffer size to trigger delivery of files sized between 100 to 250 MB in Kinesis Firehose.
Correct answer: D
Explanation
The correct answer is D because decreasing the buffer size allows for quicker delivery of smaller files, which helps Snowpipe operate more efficiently and cost-effectively. Option A is incorrect as increasing the virtual warehouse size would typically lead to higher costs. Option B is not ideal since splitting files may not be necessary when optimizing for performance. Option C, while it might improve performance, does not directly address cost minimization.