Databricks Certified Data Engineer Professional — Question 217
A data engineer is configuring Delta Sharing for a Databricks-to-Databricks scenario to optimize read performance. The recipient needs to perform time travel queries and streaming reads on shared sales data.
Which configuration will provide the optimal performance while enabling these capabilities?
Answer options
- A. Use the open sharing protocol instead of Databricks-to Databricks sharing for better performance.
- B. Share tables WITHOUT HISTORY and enable partitioning for better query performance.
- C. Share tables WITH HISTORY, ensure tables don't have partitioning enabled. and enable CDF before sharing.
- D. Share the entire schema WITHOUT HISTORY and rely on recipient-side caching for performance.
Correct answer: C
Explanation
Option C is correct because sharing tables WITH HISTORY allows for time travel queries while ensuring that change data feed (CDF) is enabled, which is essential for streaming reads. Options A and B do not support the time travel requirement, and option D lacks the necessary history, making it unsuitable for the specified use case.