AWS Certified Solutions Architect – Professional — Question 423

Your department creates regular analytics reports from your company's log files All log data is collected in Amazon S3 and processed by daily Amazon Elastic
MapReduce (EMR) jobs that generate daily PDF reports and aggregated tables in CSV format for an Amazon Redshift data warehouse.
Your CFO requests that you optimize the cost structure for this system.
Which of the following alternatives will lower costs without compromising average performance of the system or data integrity for the raw data?

Answer options

Correct answer: C

Explanation

To maintain the data integrity of the critical raw logs, they must remain on standard Amazon S3 storage rather than reduced redundancy storage (RRS), whereas reproducible outputs like PDFs and CSVs can safely use RRS to save costs. Amazon Redshift does not support Spot Instances, making Reserved Instances the correct choice for cost-effective, steady-state data warehousing. Adding Spot Instances to Amazon EMR task nodes successfully lowers data processing costs without compromising the overall average performance of the system.