AWS Certified Data Engineer – Associate (DEA-C01) — Question 179

A company stores CSV files in an Amazon S3 bucket. A data engineer needs to process the data in the CSV files and store the processed data in a new S3 bucket.

The process needs to rename a column, remove specific columns, ignore the second row of each file, create a new column based on the values of the first row of the data, and filter the results by a numeric value of a column.

Which solution will meet these requirements with the LEAST development effort?

Answer options

Correct answer: D

Explanation

AWS Glue DataBrew is designed for data preparation tasks with minimal coding, making it ideal for the requirements of renaming, removing, and filtering columns. The other options, such as AWS Glue Python jobs or custom crawlers, would involve more complex coding and development effort to achieve the same results.