Google Cloud Professional Data Engineer — Question 51

You are designing storage for 20 TB of text files as part of deploying a data pipeline on Google Cloud. Your input data is in CSV format. You want to minimize the cost of querying aggregate values for multiple users who will query the data in Cloud Storage with multiple engines. Which storage service and schema design should you use?

Answer options

Correct answer: C

Explanation

The correct answer is C because using Cloud Storage with permanent tables in BigQuery allows for efficient querying of large datasets while minimizing costs. Options A and B involve Cloud Bigtable, which is not cost-effective for this scenario, and option D suggests temporary tables, which are less suitable for repeated queries by multiple users.