Databricks Certified Machine Learning Associate — Question 4
Which of the following statements describes a Spark ML estimator?
Answer options
- A. An estimator is a hyperparameter grid that can be used to train a model
- B. An estimator chains multiple algorithms together to specify an ML workflow
- C. An estimator is a trained ML model which turns a DataFrame with features into a DataFrame with predictions
- D. An estimator is an algorithm which can be fit on a DataFrame to produce a Transformer
- E. An estimator is an evaluation tool to assess to the quality of a model
Correct answer: D
Explanation
The correct answer is D because an estimator in Spark ML is indeed an algorithm that can be fitted to a DataFrame to create a Transformer. Options A, B, and C describe different components of the ML process but do not accurately define an estimator. Option E mischaracterizes the role of an estimator as an evaluation tool rather than a fitting algorithm.