AWS Certified Data Engineer – Associate (DEA-C01) — Question 125

A company reads data from customer databases that run on Amazon RDS. The databases contain many inconsistent fields. For example, a customer record field that iPnamed place_id in one database is named location_id in another database. The company needs to link customer records across different databases, even when customer record fields do not match.

Which solution will meet these requirements with the LEAST operational overhead?

Answer options

Correct answer: B

Explanation

Option B is the best choice as it utilizes AWS Glue, which automates the process of data discovery and transformation with minimal operational overhead. The other options involve more complex setups and maintenance, such as managing an EMR cluster or building custom pipelines, which increases operational workload.