Google Cloud Professional Data Engineer — Question 267

You issue a new batch job to Dataflow. The job starts successfully, processes a few elements, and then suddenly fails and shuts down. You navigate to the
Dataflow monitoring interface where you find errors related to a particular DoFn in your pipeline. What is the most likely cause of the errors?

Answer options

Correct answer: B

Explanation

The most likely reason for the errors is 'Exceptions in worker code', as this indicates there is an issue with the code within the worker that processes the data. Other options like 'Job validation' and 'Graph or pipeline construction' are typically checked before the job runs and wouldn't cause a failure after the job has started. 'Insufficient permissions' would generally prevent the job from starting at all, rather than causing it to fail mid-execution.