Databricks Certified Data Engineer Associate — Question 22
A data engineer has a single-task Job that runs each morning before they begin working. After identifying an upstream data issue, they need to set up another task to run a new notebook prior to the original task.
Which of the following approaches can the data engineer use to set up the new task?
Answer options
- A. They can clone the existing task in the existing Job and update it to run the new notebook.
- B. They can create a new task in the existing Job and then add it as a dependency of the original task.
- C. They can create a new task in the existing Job and then add the original task as a dependency of the new task.
- D. They can create a new job from scratch and add both tasks to run concurrently.
- E. They can clone the existing task to a new Job and then edit it to run the new notebook.
Correct answer: B
Explanation
The correct answer is B because adding the new task as a dependency of the original task ensures that the new notebook runs before the existing task. Option A is incorrect as cloning does not establish the correct order, while C reverses the dependency, which is not what the data engineer wants. D suggests running concurrently, which does not solve the dependency issue, and E unnecessarily creates a new Job.