Databricks Certified Data Engineer Professional — Question 213

An analytics team wants run an experiment in the short term on the customer transaction Delta table (with 20 billions records) created by the data engineering team in Databricks SQL.

Which strategy should the data engineering team use to ensure minimal downtime and no impact on the ongoing ETL processes?

Answer options

Correct answer: C

Explanation

The correct answer is C because a shallow clone allows the analytics team to work with a copy of the table without duplicating the entire dataset, thus ensuring minimal downtime and no impact on ongoing ETL processes. Option A is incorrect as deep cloning would create a full copy, which could lead to downtime. Option B is not suitable because creating a new table might interfere with the existing ETL processes. Option D does not provide a separate workspace for the analytics team, which could affect their experimentation.