A company has several new datasets in CSV and JSON formats. A data engineer needs to make…

Question

A company has several new datasets in CSV and JSON formats. A data engineer needs to make the data available to a team of data analysts who will analyze the data by using SQL queries. Which solution will meet these requirements in the MOST cost-effective way?

Accepted Answer

Correct answer: C. C. Store the data in an Amazon S3 bucket. Use an AWS Glue crawler to catalog the S3 bucket as tables. Create an Amazon Athena workgroup that has a data usage threshold. Grant the data analysts access to the Athena workgroup. — The correct answer is C because storing data in an Amazon S3 bucket and using AWS Glue with Amazon Athena is a cost-effective way to query data without needing a dedicated database infrastructure. Option A involves higher costs due to the management of an RDS instance, while B does not provide direct SQL query capabilities, and D incurs costs related to using QuickSight and SPICE.

AWS Certified Data Engineer – Associate (DEA-C01) — Question 206

Answer options

Correct answer: C

Explanation