AWS Certified Solutions Architect – Professional (SAP-C02) — Question 145

A financial services company receives a regular data feed from its credit card servicing partner. Approximately 5,000 records are sent every 15 minutes in plaintext, delivered over HTTPS directly into an Amazon S3 bucket with server-side encryption. This feed contains sensitive credit card primary account number (PAN) data. The company needs to automatically mask the PAN before sending the data to another S3 bucket for additional internal processing. The company also needs to remove and merge specific fields, and then transform the record into JSON format. Additionally, extra feeds are likely to be added in the future, so any design needs to be easily expandable.

Which solutions will meet these requirements?

Answer options

Correct answer: C

Explanation

The correct answer is C because it utilizes AWS Glue for ETL processing, which is well-suited for transforming and processing large data sets, including the ability to output in JSON format. Options A and B rely on AWS Lambda and SQS, which may not be as efficient for larger datasets or complex transformations. Option D introduces unnecessary complexity with Amazon Athena and EMR, which aren't needed for this straightforward ETL task.