CompTIA Data+ (DA0-001) — Question 83
Which of the following tools would be best to use to calculate the interquartile range, median, mean, and standard deviation of a column in a table that has 5,000,000 rows?
Answer options
- A. Microsoft Excel
- B. R
- C. Snowflake
- D. SQL
Correct answer: B
Explanation
R is specifically designed for statistical analysis and can efficiently handle large datasets, making it the best option for calculating statistics such as interquartile range, median, mean, and standard deviation. While Microsoft Excel can perform some of these calculations, it is not optimal for very large datasets like 5,000,000 rows. Snowflake and SQL are more focused on data warehousing and querying, rather than advanced statistical computations.