SnowPro Advanced: Data Engineer — Question 94
A Data Engineer is developing a data pipeline to ingest data from an Amazon S3 bucket into a Snowflake table.
Files will be loaded into the S3 bucket randomly, with most files loaded during working hours. The files are in a compressed CSV format (*.csv.gz) and average 100 MB per file.
The data ingestion requirements are listed in order of priority:
1. MOST cost-effective method
2. LEAST amount of operational overhead
3. New data MUST be available within 10 minutes of the creation of a file
Which ingestion method will meet these requirements?
Answer options
- A. Snowpipe
- B. Snowpipe Streaming
- C. Snowflake Kafka Connector
- D. The COPY INTO {location} command
Correct answer: A
Explanation
Snowpipe is the best choice because it is designed for continuous data ingestion with minimal operational overhead and is cost-effective, charging based on the amount of data ingested. Snowpipe Streaming is typically used for real-time data, which may introduce higher costs and complexity. The Snowflake Kafka Connector is more suited for streaming data from Kafka rather than S3, and the COPY INTO command does not guarantee the 10-minute availability requirement.