Google Cloud Associate Data Practitioner — Question 84
Following a recent company acquisition, you inherited an on-premises data infrastructure that needs to move to Google Cloud. The acquired system has 250 Apache Airflow directed acyclic graphs (DAGs) orchestrating data pipelines. You need to migrate the pipelines to a Google Cloud managed service with minimal effort. What should you do?
Answer options
- A. Create a Google Kubernetes Engine (GKE) standard cluster and deploy Airflow as a workload. Migrate all DAGs to the new Airflow environment.
- B. Create a Cloud Data Fusion instance. For each DAG, create a Cloud Data Fusion pipeline.
- C. Create a new Cloud Composer environment and copy DAGs to the Cloud Composer dags/ folder.
- D. Convert each DAG to a Cloud Workflow and automate the execution with Cloud Scheduler.
Correct answer: C
Explanation
The correct answer is C because Cloud Composer is specifically designed for managing Apache Airflow workflows on Google Cloud, making it the simplest option to migrate existing DAGs. Options A and B involve more complex setups that would require significant reworking of the existing DAGs, while option D would necessitate converting the DAGs into a different format, adding unnecessary complexity to the migration process.