Databricks Certified Data Engineer Professional — Question 20
A production cluster has 3 executor nodes and uses the same virtual machine type for the driver and executor.
When evaluating the Ganglia Metrics for this cluster, which indicator would signal a bottleneck caused by code executing on the driver?
Answer options
- A. The five Minute Load Average remains consistent/flat
- B. Bytes Received never exceeds 80 million bytes per second
- C. Total Disk Space remains constant
- D. Network I/O never spikes
- E. Overall cluster CPU utilization is around 25%
Correct answer: E
Explanation
The correct answer is E because if the overall cluster CPU utilization is around 25%, it indicates that the driver may be under heavy load, causing a bottleneck. The other options do not directly reflect performance issues related to the driver's execution, as they focus on load averages, disk space, network I/O, and bytes received, which do not necessarily indicate a driver code execution problem.