AWS Certified Data Engineer – Associate (DEA-C01) — Question 182

A company has AWS resources in multiple AWS Regions. The company has an Amazon EFS file system in each Region where the company operates. The company’s data science team operates within only a single Region. The data that the data science team works with must remain within the team's Region.

A data engineer needs to create a single dataset by processing files that are in each of the company's Regional EFS file systems. The data engineer wants to use an AWS Step Functions state machine to orchestrate AWS Lambda functions to process the data.

Which solution will meet these requirements with the LEAST effort?

Answer options

Correct answer: D

Explanation

The correct answer, D, effectively addresses the requirement of keeping data within the data science team’s Region while minimizing effort by automating the file transfer process using AWS DataSync. Options A and B complicate the architecture by requiring VPC peering and data replication, respectively, which adds unnecessary complexity. Option C involves deploying Lambda functions in multiple Regions, increasing management overhead and violating the requirement to keep the data local to the team's Region.