Databricks Certified Data Engineer Professional — Question 66

A data architect has heard about Delta Lake’s built-in versioning and time travel capabilities. For auditing purposes, they have a requirement to maintain a full record of all valid street addresses as they appear in the customers table.

The architect is interested in implementing a Type 1 table, overwriting existing records with new values and relying on Delta Lake time travel to support long-term auditing. A data engineer on the project feels that a Type 2 table will provide better performance and scalability.

Which piece of information is critical to this decision?

Answer options

Correct answer: D

Explanation

The correct answer is D because Delta Lake time travel can incur high costs and latency when scaling for long-term versioning, making it less suitable for Type 1 tables. Option A discusses data corruption, which is not directly relevant to the scalability issue, while option B mentions shallow clones, which do not address the core scalability concern. Option C inaccurately states that time travel cannot be used with Type 1 tables, which is incorrect as it can be used but may not be cost-effective at scale. Option E is incorrect as Delta Lake does allow updates to tables.