CompTIA DataX (DY0-001) — Question 8

A data analyst wants to save a newly analyzed data set to a local storage option. The data set must meet the following requirements:

Be minimal in size -
Have the ability to be ingested quickly
Have the associated schema, including data types, stored with it
Which of the following file types is the best to use?

Answer options

Correct answer: B

Explanation

The best choice is Parquet because it is designed for efficient data storage and retrieval, offering excellent compression and support for schema evolution. While JSON and XML are more flexible, they do not provide the same level of compactness and performance for large datasets. CSV lacks the ability to store data types and schema information, making it less suitable for this scenario.