AWS Certified Solutions Architect – Associate (SAA-C03) — Question 321

A company uses a legacy application to produce data in CSV format. The legacy application stores the output data in Amazon S3. The company is deploying a new commercial off-the-shelf (COTS) application that can perform complex SQL queries to analyze data that is stored in Amazon Redshift and Amazon S3 only. However, the COTS application cannot process the .csv files that the legacy application produces.

The company cannot update the legacy application to produce data in another format. The company needs to implement a solution so that the COTS application can use the data that the legacy application produces.

Which solution will meet these requirements with the LEAST operational overhead?

Answer options

Correct answer: A

Explanation

AWS Glue is a fully managed, serverless ETL service that requires minimal operational overhead compared to managing EC2 instances or EMR clusters, making Option A the most efficient choice. Storing the processed data in Amazon DynamoDB as proposed in Option C does not meet the requirements because the COTS application can only query Amazon Redshift and Amazon S3. Option B and D introduce unnecessary management and operational complexity compared to the serverless capabilities of AWS Glue.