Databricks Certified Data Analyst Associate — Question 37
A data team has been given a series of projects by a consultant that need to be implemented in the Databricks Lakehouse Platform.
Which of the following projects should be completed in Databricks SQL?
Answer options
- A. Testing the quality of data as it is imported from a source
- B. Tracking usage of feature variables for machine learning projects
- C. Combining two data sources into a single, comprehensive dataset
- D. Segmenting customers into like groups using a clustering algorithm
- E. Automating complex notebook-based workflows with multiple tasks
Correct answer: C
Explanation
The correct answer, C, is appropriate for Databricks SQL because it involves data manipulation and querying, which is a strength of SQL. The other options focus on tasks like data quality testing, feature tracking, customer segmentation, and automation, which may require different tools or environments better suited for those specific functionalities.