Databricks Certified Data Engineer Professional — Question 118
A Databricks job has been configured with three tasks, each of which is a Databricks notebook. Task A does not depend on other tasks. Tasks B and C run in parallel, with each having a serial dependency on task A.
What will be the resulting state if tasks A and B complete successfully but task C fails during a scheduled run?
Answer options
- A. All logic expressed in the notebook associated with tasks A and B will have been successfully completed; some operations in task C may have completed successfully.
- B. Unless all tasks complete successfully, no changes will be committed to the Lakehouse; because task C failed, all commits will be rolled back automatically.
- C. Because all tasks are managed as a dependency graph, no changes will be committed to the Lakehouse until all tasks have successfully been completed.
- D. All logic expressed in the notebook associated with tasks A and B will have been successfully completed; any changes made in task C will be rolled back due to task failure.
Correct answer: A
Explanation
The correct answer is A because Tasks A and B complete successfully, meaning their operations are finalized. Task C's failure does not affect the success of Tasks A and B, and thus any operations it may have performed before failing could be valid. Options B and C incorrectly imply a rollback of all changes, while D suggests that Task C's changes would negate the success of Tasks A and B, which is not the case.