AWS Certified Machine Learning – Specialty — Question 54
A manufacturing company has structured and unstructured data stored in an Amazon S3 bucket. A Machine Learning Specialist wants to use SQL to run queries on this data.
Which solution requires the LEAST effort to be able to query this data?
Answer options
- A. Use AWS Data Pipeline to transform the data and Amazon RDS to run queries.
- B. Use AWS Glue to catalogue the data and Amazon Athena to run queries.
- C. Use AWS Batch to run ETL on the data and Amazon Aurora to run the queries.
- D. Use AWS Lambda to transform the data and Amazon Kinesis Data Analytics to run queries.
Correct answer: B
Explanation
Option B is correct because AWS Glue simplifies the process of cataloging data, making it readily accessible for querying with Amazon Athena, which supports SQL queries directly on data stored in S3. The other options involve more complex setups and additional services that require more effort to implement and maintain, making them less efficient for this use case.