Databricks Certified Associate Developer for Apache Spark — Question 1
Which of the following statements about Spark’s stability is incorrect?
Answer options
- A. Spark is designed to support the loss of any set of worker nodes.
- B. Spark will rerun any failed tasks due to failed worker nodes.
- C. Spark will recompute data cached on failed worker nodes.
- D. Spark will spill data to disk if it does not fit in memory.
- E. Spark will reassign the driver to a worker node if the driver’s node fails.
Correct answer: E
Explanation
The correct answer is E because Spark does not automatically reassign the driver to another worker node in the event of a failure; it requires the application to handle driver restarts. Options A, B, C, and D correctly describe Spark's fault tolerance and memory management capabilities.