AWS Certified Data Engineer – Associate (DEA-C01) — Question 134
A data engineer has implemented data quality rules in 1,000 AWS Glue Data Catalog tables. Because of a recent change in business requirements, the data engineer must edit the data quality rules.
How should the data engineer meet this requirement with the LEAST operational overhead?
Answer options
- A. Create a pipeline in AWS Glue ETL to edit the rules for each of the 1,000 Data Catalog tables. Use an AWS Lambda function to call the corresponding AWS Glue job for each Data Catalog table.
- B. Create an AWS Lambda function that makes an API call to AWS Glue Data Quality to make the edits.
- C. Create an Amazon EMR cluster. Run a pipeline on Amazon EMR that edits the rules for each Data Catalog table. Use an AWS Lambda function to run the EMR pipeline.
- D. Use the AWS Management Console to edit the rules within the Data Catalog.
Correct answer: B
Explanation
The correct answer is B because using an AWS Lambda function to make an API call to AWS Glue Data Quality allows for a streamlined and automated approach to editing the rules without needing to manually update each table or manage additional infrastructure. The other options introduce unnecessary complexity or operational overhead, such as creating pipelines or clusters, which are not required for this task.