AWS Certified Data Engineer – Associate (DEA-C01) — Question 187

An online retailer uses multiple delivery partners to deliver products to customers. The delivery partners send order summaries to the retailer. The retailer stores the order summaries in Amazon S3.

Some of the order summaries contain personally identifiable information (PII) about customers. A data engineer needs to detect PII in the order summaries so the company can redact the PII.

Which solution will meet these requirements with the LEAST operational overhead?

Answer options

Correct answer: C

Explanation

Amazon Macie is designed specifically for discovering and protecting PII in data stored in AWS, making it the best fit for this scenario with minimal operational overhead. Amazon Textract primarily focuses on extracting text from documents, while Amazon S3 Storage Lens provides insights into storage usage and activity, neither of which directly addresses PII detection. Amazon SageMaker Data Wrangler is used for data preparation and analysis, not specifically for PII identification.