Databricks Certified Data Engineer Professional — Question 143
A junior data engineer is migrating a workload from a relational database system to the Databricks Lakehouse. The source system uses a star schema, leveraging foreign key constraints and multi-table inserts to validate records on write.
Which consideration will impact the decisions made by the engineer while migrating this workload?
Answer options
- A. Databricks only allows foreign key constraints on hashed identifiers, which avoid collisions in highly-parallel writes.
- B. Foreign keys must reference a primary key field; multi-table inserts must leverage Delta Lake’s upsert functionality.
- C. Committing to multiple tables simultaneously requires taking out multiple table locks and can lead to a state of deadlock.
- D. All Delta Lake transactions are ACID compliant against a single table, and Databricks does not enforce foreign key constraints.
Correct answer: D
Explanation
The correct answer is D because Delta Lake ensures ACID compliance for operations on individual tables but does not enforce foreign key constraints, which differs from traditional relational databases. Options A, B, and C are incorrect as they misrepresent the functionality and behavior of Databricks Lakehouse regarding foreign keys and transaction handling.