A data engineer is processing and analyzing multiple terabytes of raw data that is in Ama…

Question

A data engineer is processing and analyzing multiple terabytes of raw data that is in Amazon S3. The data engineer needs to clean and prepare the data. Then the data engineer needs to load the data into Amazon Redshift for analytics. The data engineer needs a solution that will give data analysts the ability to perform complex queries. The solution must eliminate the need to perform complex extract, transform, and load (ETL) processes or to manage infrastructure. Which solution will meet these requirements with the LEAST operational overhead?

Accepted Answer

Correct answer: B. B. Use AWS Glue DataBrew to prepare the data. Use AWS Glue to load the data into Amazon Redshift. Use Amazon Redshift to run queries. — The correct answer, B, uses AWS Glue DataBrew for data preparation, which simplifies the process with a low-code interface, and AWS Glue for loading data, streamlining the ETL process without infrastructure management. Other options either involve more complex setups or do not provide the same ease of use and operational efficiency as AWS Glue DataBrew and Glue do in this scenario.

AWS Certified Data Engineer – Associate (DEA-C01) — Question 124

Answer options

Correct answer: B

Explanation