Databricks Certified Data Engineer Associate — Question 18

A data engineer has a Job with multiple tasks that runs nightly. Each of the tasks runs slowly because the clusters take a long time to start.
Which of the following actions can the data engineer perform to improve the start up time for the clusters used for the Job?

Answer options

Correct answer: D

Explanation

Choosing clusters from a cluster pool allows for faster startup times because the clusters are pre-provisioned and can be allocated quickly. Options A, B, and C do not specifically address the issue of startup time, while option E might improve performance but not the initialization speed of the clusters.