SnowPro Advanced: Data Engineer — Question 55
A Data Engineer needs to ingest invoice data in PDF format into Snowflake so that the data can be queried and used in a forecasting solution.
What is the recommended way to ingest this data?
Answer options
- A. Use Snowpipe to ingest the files that land in an external stage into a Snowflake table.
- B. Use a COPY INTO command to ingest the PDF files in an external stage into a Snowflake table with a VARIANT column.
- C. Create an external table on the PDF files that are stored in a stage and parse the data into structured data.
- D. Create a Java User-Defined Function (UDF) that leverages Java-based PDF parser libraries to parse PDF data into structured data.
Correct answer: D
Explanation
The correct answer is D because creating a Java User-Defined Function allows for the implementation of specialized parsing logic needed to extract data from PDF files into a structured format. Options A, B, and C do not provide a mechanism to parse PDF content, as Snowflake does not natively support PDF ingestion or querying without a transformation layer like a UDF.