Google Cloud Professional Data Engineer — Question 55

You want to analyze hundreds of thousands of social media posts daily at the lowest cost and with the fewest steps.
You have the following requirements:
✑ You will batch-load the posts once per day and run them through the Cloud Natural Language API.
✑ You will extract topics and sentiment from the posts.
✑ You must store the raw posts for archiving and reprocessing.
✑ You will create dashboards to be shared with people both inside and outside your organization.
You need to store both the data extracted from the API to perform analysis as well as the raw social media posts for historical archiving. What should you do?

Answer options

Correct answer: C

Explanation

Option C is correct because it combines the advantages of Cloud Storage for archiving raw posts and BigQuery for analytical data, ensuring efficient processing and low cost. Options A and B do not effectively separate raw data storage from processed data, potentially complicating archiving and reprocessing. Option D fails to meet the requirement for archiving raw posts since it does not store them separately.