Google Cloud Professional Data Engineer — Question 50

You are implementing security best practices on your data pipeline. Currently, you are manually executing jobs as the Project Owner. You want to automate these jobs by taking nightly batch files containing non-public information from Google Cloud Storage, processing them with a Spark Scala job on a Google Cloud
Dataproc cluster, and depositing the results into Google BigQuery.
How should you securely run this workload?

Answer options

Correct answer: C

Explanation

The correct answer is C because using a service account with the appropriate permissions ensures that the workload can run automatically and securely without exposing sensitive information. Option A does not allow for automation, while B grants excessive permissions, and D uses a user account that does not have enough permissions to write to BigQuery.