AWS Certified Data Engineer – Associate (DEA-C01) — Question 27

A company uses AWS Step Functions to orchestrate a data pipeline. The pipeline consists of Amazon EMR jobs that ingest data from data sources and store the data in an Amazon S3 bucket. The pipeline also includes EMR jobs that load the data to Amazon Redshift.
The company's cloud infrastructure team manually built a Step Functions state machine. The cloud infrastructure team launched an EMR cluster into a VPC to support the EMR jobs. However, the deployed Step Functions state machine is not able to run the EMR jobs.
Which combination of steps should the company take to identify the reason the Step Functions state machine is not able to run the EMR jobs? (Choose two.)

Answer options

Correct answer: B, D

Explanation

Option B is correct because it ensures the Step Functions state machine has the necessary IAM permissions to execute EMR jobs and access S3, which is critical for the pipeline. Option D is also correct as it checks the connectivity and security settings between the EMR cluster and data sources, which could prevent job execution. The other options do not directly address the permission and connectivity issues that are likely causing the failure.