AWS Certified Data Engineer – Associate (DEA-C01) — Question 179
A company stores CSV files in an Amazon S3 bucket. A data engineer needs to process the data in the CSV files and store the processed data in a new S3 bucket.
The process needs to rename a column, remove specific columns, ignore the second row of each file, create a new column based on the values of the first row of the data, and filter the results by a numeric value of a column.
Which solution will meet these requirements with the LEAST development effort?
Answer options
- A. Use AWS Glue Python jobs to read and transform the CSV files.
- B. Use an AWS Glue custom crawler to read and transform the CSV files.
- C. Use an AWS Glue workflow to build a set of jobs to crawl and transform the CSV files.
- D. Use AWS Glue DataBrew recipes to read and transform the CSV files.
Correct answer: D
Explanation
AWS Glue DataBrew is designed for data preparation tasks with minimal coding, making it ideal for the requirements of renaming, removing, and filtering columns. The other options, such as AWS Glue Python jobs or custom crawlers, would involve more complex coding and development effort to achieve the same results.