SnowPro Advanced: Data Engineer — Question 113
A company has deployed a data pipeline that streams customer transaction data from an on-premises PostgreSQL database to Snowflake for real-time analytics. A Data Engineer needs to configure a connector that will continuously and reliably transfer data to Snowflake.
Which connector configuration will meet these requirements, while providing MINIMAL latency, and OPTIMAL performance?
Answer options
- A. Configure the Snowflake Connector for Kafka to continuously stream data from the PostgreSQL database into Snowflake by configuring PostgreSQL as a Kafka producer.
- B. Deploy the Snowflake JDBC driver to connect the PostgreSQL database directly to Snowflake, setting up a continuous query on the PostgreSQL database to push data to Snowflake.
- C. Install the Snowflake Connector for Python on the PostgreSQL server to stream data directly to Snowflake, using Python scripts to manage the data transfer.
- D. Configure the Snowflake Connector for Kafka on a Kafka Connect cluster to stream data from the PostgreSQL database to Kafka topics, which will be ingested using the Kafka connector.
Correct answer: D
Explanation
The correct answer is D because configuring the Snowflake Connector for Kafka on a Kafka Connect cluster allows for efficient streaming of data with minimal latency and optimal performance. Options A and C do not leverage the full capabilities of Kafka, which is designed for high throughput data streaming, while option B does not provide the same level of continuous integration and performance as the Kafka setup.