AWS Certified Data Analytics – Specialty — Question 74

An online retail company is migrating its reporting system to AWS. The company's legacy system runs data processing on online transactions using a complex series of nested Apache Hive queries. Transactional data is exported from the online system to the reporting system several times a day. Schemas in the files are stable between updates.
A data analyst wants to quickly migrate the data processing to AWS, so any code changes should be minimized. To keep storage costs low, the data analyst decides to store the data in Amazon S3. It is vital that the data from the reports and associated analytics is completely up to date based on the data in Amazon S3.
Which solution meets these requirements?

Answer options

Correct answer: A

Explanation

Option A is the correct solution because it effectively uses AWS Glue to manage metadata and ensures that data changes are captured through a crawler that runs when data is refreshed. This approach minimizes code changes while maintaining the integrity of data processing. The other options either introduce unnecessary complexity or do not ensure the data is completely up to date with the latest transactions.