Databricks Certified Associate Developer for Apache Spark — Question 160

The code block shown below contains an error. The code block is intended to create and register a SQL UDF named "ASSESS_PERFORMANCE" using the Python function assessPerformance() and apply it to column customerSatistfaction in table stores. Identify the error.

Code block:

spark.udf.register("ASSESS_PERFORMANCE", assessPerformance)
spark.sql("SELECT customerSatisfaction, assessPerformance(customerSatisfaction) AS result FROM stores")

Answer options

Correct answer: E

Explanation

The correct answer is E because the SQL function name in the query should match the registered UDF name 'ASSESS_PERFORMANCE'. The other options are incorrect because the use of sql() is valid, the order of arguments in spark.udf.register() is correct, columns can be referenced multiple times in SQL, and registered UDFs can indeed be used in SQL statements.