Your organization has a petabyte of application logs stored as Parquet files in Cloud Sto…

Question

Your organization has a petabyte of application logs stored as Parquet files in Cloud Storage. You need to quickly perform a one-time SQL-based analysis of the files and join them to data that already resides in BigQuery. What should you do?

Accepted Answer

Correct answer: C. C. Create external tables over the files in Cloud Storage, and perform SQL joins to tables in BigQuery to analyze the data. — The correct answer is C because creating external tables allows you to query the Parquet files directly in Cloud Storage using SQL, which is efficient for one-time analyses. Option A requires the overhead of managing a Dataproc cluster, B involves more setup with Cloud Data Fusion, and D necessitates loading data into BigQuery, which is unnecessary for a one-time analysis.

Google Cloud Associate Data Practitioner — Question 39

Answer options

Correct answer: C

Explanation