A large company seeks to implement a near real-time solution involving hundreds of pipeli…

Question

A large company seeks to implement a near real-time solution involving hundreds of pipelines with parallel updates of many tables with extremely high volume and high velocity data. Which of the following solutions would you implement to achieve this requirement?

Accepted Answer

Correct answer: B. B. Partition ingestion tables by a small time duration to allow for many data files to be written in parallel. — The correct answer is B because partitioning ingestion tables by a small time duration allows for parallel writing of many data files, which is essential for handling high volume and velocity data. Options A, C, D, and E do not specifically address the requirement for parallel updates of multiple tables and may not effectively optimize for the high data throughput needed in this scenario.

Databricks Certified Data Engineer Professional — Question 73

Answer options

Correct answer: B

Explanation