Databricks Certified Data Engineer Professional — Question 61

A new data engineer notices that a critical field was omitted from an application that writes its Kafka source to Delta Lake. This happened even though the critical field was in the Kafka source. That field was further missing from data written to dependent, long-term storage. The retention threshold on the Kafka service is seven days. The pipeline has been in production for three months.
Which describes how Delta Lake can help to avoid data loss of this nature in the future?

Answer options

Correct answer: E

Explanation

The correct answer is E because ingesting all raw data and metadata into a bronze Delta table creates a comprehensive and enduring record of the data, which can be replayed to recover any lost information. Options A, B, C, and D are incorrect as they do not provide a solution to prevent data loss; they describe functionalities that do not address the underlying issue of missing data from the ingestion pipeline.