Google Cloud Professional Data Engineer — Question 225

Your team is building a data lake platform on Google Cloud. As a part of the data foundation design, you are planning to store all the raw data in Cloud Storage. You are expecting to ingest approximately 25 GB of data a day and your billing department is worried about the increasing cost of storing old data. The current business requirements are:

• The old data can be deleted anytime.
• There is no predefined access pattern of the old data.
• The old data should be available instantly when accessed.
• There should not be any charges for data retrieval.

What should you do to optimize for cost?

Answer options

Correct answer: A

Explanation

The correct answer is A because using the Autoclass storage class feature allows Cloud Storage to automatically move data to the most cost-effective storage class based on access patterns, which aligns with the requirement for no charges on retrieval. The other options involve setting specific lifecycle policies that may incur costs for data retrieval and do not guarantee the instant availability of data as required.