Databricks Certified Data Engineer Professional — Question 43
Two of the most common data locations on Databricks are the DBFS root storage and external object storage mounted with dbutils.fs.mount().
Which of the following statements is correct?
Answer options
- A. DBFS is a file system protocol that allows users to interact with files stored in object storage using syntax and guarantees similar to Unix file systems.
- B. By default, both the DBFS root and mounted data sources are only accessible to workspace administrators.
- C. The DBFS root is the most secure location to store data, because mounted storage volumes must have full public read and write permissions.
- D. Neither the DBFS root nor mounted storage can be accessed when using %sh in a Databricks notebook.
- E. The DBFS root stores files in ephemeral block volumes attached to the driver, while mounted directories will always persist saved data to external storage between sessions.
Correct answer: A
Explanation
Option A is correct because DBFS indeed functions as a file system protocol that mimics Unix file system behaviors. Option B is incorrect since data sources are accessible to more than just administrators. Option C is wrong as the DBFS root's security does not stem from mounted storage permissions. Option D is false because both DBFS root and mounted storage can be accessed in %sh. Option E is misleading because DBFS root does not use ephemeral storage for its files.