AWS Certified Data Engineer – Associate (DEA-C01) — Question 40
A data engineer must orchestrate a data pipeline that consists of one AWS Lambda function and one AWS Glue job. The solution must integrate with AWS services.
Which solution will meet these requirements with the LEAST management overhead?
Answer options
- A. Use an AWS Step Functions workflow that includes a state machine. Configure the state machine to run the Lambda function and then the AWS Glue job.
- B. Use an Apache Airflow workflow that is deployed on an Amazon EC2 instance. Define a directed acyclic graph (DAG) in which the first task is to call the Lambda function and the second task is to call the AWS Glue job.
- C. Use an AWS Glue workflow to run the Lambda function and then the AWS Glue job.
- D. Use an Apache Airflow workflow that is deployed on Amazon Elastic Kubernetes Service (Amazon EKS). Define a directed acyclic graph (DAG) in which the first task is to call the Lambda function and the second task is to call the AWS Glue job.
Correct answer: A
Explanation
Option A is the correct answer because AWS Step Functions provide a fully managed service that simplifies orchestration with minimal management overhead. Options B and D involve deploying Apache Airflow, which requires more management and infrastructure setup. Option C, while it uses AWS Glue, does not specifically orchestrate the Lambda function effectively.