AWS Certified Data Engineer – Associate (DEA-C01) — Question 236
A data engineer is building a solution to detect sensitive information that is stored in a data lake across multiple Amazon S3 buckets. The solution must detect personally identifiable information (PII) that is in a proprietary data format.
Which solution will meet these requirements with the LEAST operational overhead?
Answer options
- A. Use the AWS Glue Detect PII transform with specific patterns.
- B. Use Amazon Made with managed data identifiers.
- C. Use an AWS Lambda function with custom regular expressions.
- D. Use Amazon Athena with a SQL query to match the custom formats.
Correct answer: A
Explanation
The correct answer is A because AWS Glue's Detect PII transform is specifically designed to identify sensitive information with minimal operational overhead, utilizing predefined patterns. Options B, C, and D either involve more complexity or do not directly address the requirement for detecting PII in proprietary formats as effectively as option A.