Databricks Certified Data Engineer Associate — Question 24

A data engineer has three tables in a Delta Live Tables (DLT) pipeline. They have configured the pipeline to drop invalid records at each table. They notice that some data is being dropped due to quality concerns at some point in the DLT pipeline. They would like to determine at which table in their pipeline the data is being dropped.
Which of the following approaches can the data engineer take to identify the table that is dropping the records?

Answer options

Correct answer: D

Explanation

The correct answer is D because navigating to the DLT pipeline page and reviewing the data quality statistics for each table allows the engineer to pinpoint where the records are being dropped. Option A is incorrect as establishing expectations does not directly indicate where the drops occur. Option B is wrong since it is possible to identify the table. Option C does not provide the needed insights about the specific table, and option E focuses on errors rather than data quality statistics.