Databricks Certified Machine Learning Associate — Question 25

Which of the following describes the relationship between native Spark DataFrames and pandas API on Spark DataFrames?

Answer options

Correct answer: C

Explanation

The correct answer is C because the pandas API on Spark DataFrames builds on Spark DataFrames by incorporating additional metadata, enabling better compatibility with pandas-like operations. Options A and E incorrectly state the nature of their relationship, while B inaccurately claims that pandas API on Spark DataFrames offer better performance. Option D is misleading as it doesn't accurately capture the nature of their mutability.