Google Cloud Professional Cloud Architect — Question 44
Your company has successfully migrated to the cloud and wants to analyze their data stream to optimize operations. They do not have any existing code for this analysis, so they are exploring all their options. These options include a mix of batch and stream processing, as they are running some hourly jobs and live- processing some data as it comes in.
Which technology should they use for this?
Answer options
- A. Google Cloud Dataproc
- B. Google Cloud Dataflow
- C. Google Container Engine with Bigtable
- D. Google Compute Engine with Google BigQuery
Correct answer: B
Explanation
Google Cloud Dataflow is the most suitable choice as it is designed for both batch and stream processing, allowing real-time data analysis. In contrast, Google Cloud Dataproc is more suited for batch processing with Hadoop, Google Container Engine with Bigtable focuses on container management and NoSQL storage, and Google Compute Engine with Google BigQuery is primarily aimed at data warehousing rather than real-time processing capabilities.