Databricks Certified Data Engineer Professional — Question 99

Which indicators would you look for in the Spark UI’s Storage tab to signal that a cached table is not performing optimally? Assume you are using Spark’s MEMORY_ONLY storage level.

Answer options

Correct answer: C

Explanation

The correct answer is C, as a Size on Disk greater than zero indicates that the data is being cached but may not be fitting optimally in memory. Option A is incorrect because ideally, Size on Disk should not be less than Size in Memory for optimal performance. Options B, D, and E highlight other issues but do not directly indicate suboptimal performance related to caching in MEMORY_ONLY mode.