Google Cloud Professional Data Engineer — Question 18

You are designing storage for very large text files for a data pipeline on Google Cloud. You want to support ANSI SQL queries. You also want to support compression and parallel load from the input locations using Google recommended practices. What should you do?

Answer options

Correct answer: B

Explanation

The correct answer is B because it combines the use of Cloud Dataflow to transform text files into a compressed format, and utilizes both Cloud Storage and BigQuery permanent linked tables, which are optimal for ANSI SQL queries. Option A incorrectly suggests using only BigQuery for storage, while options C and D do not align with recommended practices for handling large text files with ANSI SQL support.