Databricks Certified Associate Developer for Apache Spark — Question 221
Which of the following statements about Spark DataFrames is incorrect?
Answer options
- A. Spark DataFrames are the same as a data frame in Python or R.
- B. Spark DataFrames are built on top of RDDs.
- C. Spark DataFrames are immutable.
- D. Spark DataFrames are distributed.
- E. Spark DataFrames have common Structured APIs.
Correct answer: A
Explanation
The correct answer is A because Spark DataFrames, while similar in concept to data frames in Python or R, have distinct features and optimizations specific to Spark. The other statements are true: Spark DataFrames are built on RDDs, are immutable, are distributed across clusters, and provide common Structured APIs.