Databricks Certified Data Engineer Associate — Question 143
Differentiate between all-purpose clusters and jobs clusters.
A data engineering team has created a python notebook to load data from cloud storage, this job has been tested and now needs to be scheduled in production.
Which would be the best cluster to be used in this case?
Answer options
- A. All purpose cluster
- B. Any Unity Catalog-enabled cluster
- C. Jobs Cluster
- D. Serverless SQL warehouse
Correct answer: C
Explanation
A Jobs Cluster is specifically designed for running scheduled jobs efficiently, making it the best choice for the production scheduling of the Python notebook. In contrast, an All purpose cluster is intended for interactive use and may not be optimized for job execution, while the other options do not specifically cater to the requirements of running scheduled jobs.