CompTIA DataX (DY0-001) — Question 2
A data scientist is performing a linear regression and wants to construct a model that explains the most variation in the data. Which of the following should the data scientist maximize when evaluating the regression performance metrics?
Answer options
- A. Accuracy
- B. R2
- C. p value
- D. AUC
Correct answer: B
Explanation
The correct answer is R2, which represents the proportion of variance explained by the model. Maximizing R2 indicates a better fit for the data, while accuracy, p value, and AUC are not directly related to the variance explained in a linear regression context.