Databricks Certified Data Engineer Associate — Question 77

A data engineer has a Job with multiple tasks that runs nightly. Each of the tasks runs slowly because the clusters take a long time to start.

Which action can the data engineer perform to improve the start up time for the clusters used for the Job?

Answer options

Correct answer: D

Explanation

Using clusters from a cluster pool allows for quicker access to pre-existing clusters, significantly reducing startup time. The other options do not address the startup delay directly; jobs clusters might help with resource allocation but still require starting up, while autoscaling configurations and SQL endpoints do not impact the cluster initialization speed.