AWS Certified Data Analytics – Specialty — Question 5

A manufacturing company has been collecting IoT sensor data from devices on its factory floor for a year and is storing the data in Amazon Redshift for daily analysis. A data analyst has determined that, at an expected ingestion rate of about 2 TB per day, the cluster will be undersized in less than 4 months. A long-term solution is needed. The data analyst has indicated that most queries only reference the most recent 13 months of data, yet there are also quarterly reports that need to query all the data generated from the past 7 years. The chief technology officer (CTO) is concerned about the costs, administrative effort, and performance of a long-term solution.
Which solution should the data analyst use to meet these requirements?

Answer options

Correct answer: A

Explanation

Option A is the correct choice because it efficiently manages data by unloading older records to S3 while keeping the recent data in Redshift for performance. Option B does not address the data retention and query performance concerns. Option C fails to provide a mechanism for managing the older data in a cost-effective manner. Option D, while comprehensive, introduces unnecessary complexity and cost without utilizing the existing Redshift capabilities effectively.