CompTIA DataX (DY0-001) — Question 3

A data scientist is building an inferential model with a single predictor variable. A scatter plot of the independent variable against the real-number dependent variable shows a strong relationship between them. The predictor variable is normally distributed with very few outliers. Which of the following algorithms is the best fit for this model, given the data scientist wants the model to be easily interpreted?

Answer options

Correct answer: C

Explanation

The correct answer is C, a linear regression, as it provides a straightforward interpretation of the relationship between the predictor and dependent variable. The other options, such as logistic and probit regression, are more suited for binary outcomes, while exponential regression is not appropriate for normally distributed data with a strong linear relationship.