SnowPro Advanced: Architect — Question 146
An Architect has designed a data pipeline that is receiving small CSV files from multiple sources. All of the files are landing in one location. Specific files are filtered for loading into Snowflake tables using the COPY command. The loading performance is poor.
What changes can be made to improve the data loading performance?
Answer options
- A. Increase the size of the virtual warehouse.
- B. Create a multi-cluster warehouse and merge smaller files to create bigger files.
- C. Create a specific storage landing bucket to avoid file scanning.
- D. Change the file format from CSV to JSON.
Correct answer: B
Explanation
The best option to improve loading performance is B, as merging smaller files into larger ones reduces overhead and optimizes the COPY command execution. Increasing the size of the virtual warehouse (A) may help, but it does not address the inefficiency caused by numerous small files. Creating a specific storage landing bucket (C) and changing the file format (D) might have some benefits, but they are not as effective as combining files.