Databricks Certified Associate Developer for Apache Spark — Question 188
How does the Directed Acyclic Graph (DAG) represent the execution hierarchy in Apache Spark™?
Answer options
- A. It represents the execution of multiple jobs, where each job runs independently of stages and tasks.
- B. It represents the execution of a job as a series of interdependent stages, each broken into parallel tasks.
- C. It represents the tasks executed by Spark as individual jobs, with each job containing multiple applications.
- D. It represents only the final tasks without considering stages or jobs.
Correct answer: B
Explanation
The correct answer is B because the DAG in Apache Spark represents how a job is executed through a series of stages that are interdependent, each of which is divided into tasks that can run in parallel. Option A is incorrect as it suggests that jobs run independently, which contradicts the concept of stages. Option C misrepresents the structure by implying that tasks are individual jobs, and option D overlooks the importance of stages and jobs in the execution process.