AWS Certified Data Engineer – Associate (DEA-C01) — Question 236

A data engineer is building a solution to detect sensitive information that is stored in a data lake across multiple Amazon S3 buckets. The solution must detect personally identifiable information (PII) that is in a proprietary data format.

Which solution will meet these requirements with the LEAST operational overhead?

Answer options

Correct answer: A

Explanation

The correct answer is A because AWS Glue's Detect PII transform is specifically designed to identify sensitive information with minimal operational overhead, utilizing predefined patterns. Options B, C, and D either involve more complexity or do not directly address the requirement for detecting PII in proprietary formats as effectively as option A.