Databricks Certified Data Engineer Professional — Question 98
A Delta Lake table representing metadata about content posts from users has the following schema:
user_id LONG, post_text STRING, post_id STRING, longitude FLOAT, latitude FLOAT, post_time TIMESTAMP, date DATE
Based on the above schema, which column is a good candidate for partitioning the Delta Table?
Answer options
- A. post_time
- B. latitude
- C. post_id
- D. user_id
- E. date
Correct answer: E
Explanation
The 'date' column is an excellent choice for partitioning the Delta Table as it can help in efficiently querying data based on specific time periods. The other options, while they may have their use cases, do not provide the same level of granularity and organization for temporal data retrieval as the 'date' column does.