Databricks Certified Associate Developer for Apache Spark — Question 195
A Spark engineer must select an appropriate deployment mode for the Spark jobs.
What is the benefit of using cluster mode in Apache Spark™?
Answer options
- A. In cluster mode, resources are allocated from a resource manager on the cluster, enabling better performance and scalability for large jobs
- B. In cluster mode, the driver is responsible for executing all tasks locally without distributing them across the worker nodes.
- C. In cluster mode, the driver runs on the client machine, which can limit the application’s ability to handle large datasets efficiently.
- D. In cluster mode, the driver program runs on one of the worker nodes, allowing the application to fully utilize the distributed resources of the cluster.
Correct answer: D
Explanation
The correct answer is D because in cluster mode, the driver running on a worker node allows the application to leverage the full distributed capabilities of the cluster. Option A is incorrect as it describes a benefit but does not address the driver’s location. Option B is wrong since it implies local execution, which contradicts the purpose of cluster mode. Option C is also incorrect because it suggests a limitation of cluster mode that does not apply when the driver is on a worker node.