A data engineer in a manufacturing company is designing a data processing platform that r…

Question

A data engineer in a manufacturing company is designing a data processing platform that receives a large volume of unstructured data. The data engineer must populate a well-structured star schema in Amazon
Redshift.
What is the most efficient architecture strategy for this purpose?

Accepted Answer

Correct answer: A. A. Transform the unstructured data using Amazon EMR and generate CSV data. COPY the CSV data into the analysis schema within Redshift. — Option A is the most efficient because using Amazon EMR to transform unstructured data into a structured CSV format allows for a smooth integration into Redshift using the COPY command, which is optimized for bulk loading. The other options involve more complex processes or real-time transformations, which can add latency and processing overhead, making them less efficient for this scenario.

AWS Certified Big Data – Specialty — Question 47

Answer options

Correct answer: A

Explanation