Google Cloud Professional Data Engineer — Question 55
You want to analyze hundreds of thousands of social media posts daily at the lowest cost and with the fewest steps.
You have the following requirements:
✑ You will batch-load the posts once per day and run them through the Cloud Natural Language API.
✑ You will extract topics and sentiment from the posts.
✑ You must store the raw posts for archiving and reprocessing.
✑ You will create dashboards to be shared with people both inside and outside your organization.
You need to store both the data extracted from the API to perform analysis as well as the raw social media posts for historical archiving. What should you do?
Answer options
- A. Store the social media posts and the data extracted from the API in BigQuery.
- B. Store the social media posts and the data extracted from the API in Cloud SQL.
- C. Store the raw social media posts in Cloud Storage, and write the data extracted from the API into BigQuery.
- D. Feed to social media posts into the API directly from the source, and write the extracted data from the API into BigQuery.
Correct answer: C
Explanation
Option C is correct because it combines the advantages of Cloud Storage for archiving raw posts and BigQuery for analytical data, ensuring efficient processing and low cost. Options A and B do not effectively separate raw data storage from processed data, potentially complicating archiving and reprocessing. Option D fails to meet the requirement for archiving raw posts since it does not store them separately.