Google Cloud Professional Cloud Architect — Question 49

Your company is forecasting a sharp increase in the number and size of Apache Spark and Hadoop jobs being run on your local datacenter. You want to utilize the cloud to help you scale this upcoming demand with the least amount of operations work and code change.
Which product should you use?

Answer options

Correct answer: B

Explanation

Google Cloud Dataproc is designed specifically for running Apache Spark and Hadoop jobs in a managed environment, allowing for easy scaling with minimal operational overhead. In contrast, Google Cloud Dataflow is more suited for stream and batch data processing, Google Compute Engine provides general-purpose virtual machines, and Google Kubernetes Engine is focused on container orchestration, none of which directly target the specific needs of Spark and Hadoop workloads.