A company stores CSV files in an Amazon S3 bucket. A data engineer needs to process the d…

Question

A company stores CSV files in an Amazon S3 bucket. A data engineer needs to process the data in the CSV files and store the processed data in a new S3 bucket. The process needs to rename a column, remove specific columns, ignore the second row of each file, create a new column based on the values of the first row of the data, and filter the results by a numeric value of a column. Which solution will meet these requirements with the LEAST development effort?

Accepted Answer

Correct answer: D. D. Use AWS Glue DataBrew recipes to read and transform the CSV files. — AWS Glue DataBrew is designed for data preparation tasks with minimal coding, making it ideal for the requirements of renaming, removing, and filtering columns. The other options, such as AWS Glue Python jobs or custom crawlers, would involve more complex coding and development effort to achieve the same results.

AWS Certified Data Engineer – Associate (DEA-C01) — Question 179

Answer options

Correct answer: D

Explanation