Databricks Certified Data Engineer Professional — Question 213
An analytics team wants run an experiment in the short term on the customer transaction Delta table (with 20 billions records) created by the data engineering team in Databricks SQL.
Which strategy should the data engineering team use to ensure minimal downtime and no impact on the ongoing ETL processes?
Answer options
- A. Deep clone the table for the analytics team.
- B. Create a new table for the analytics team using a CTAS statement.
- C. Shallow clone the table for the analytics team.
- D. Give access to the table for the analytics team.
Correct answer: C
Explanation
The correct answer is C because a shallow clone allows the analytics team to work with a copy of the table without duplicating the entire dataset, thus ensuring minimal downtime and no impact on ongoing ETL processes. Option A is incorrect as deep cloning would create a full copy, which could lead to downtime. Option B is not suitable because creating a new table might interfere with the existing ETL processes. Option D does not provide a separate workspace for the analytics team, which could affect their experimentation.