Data Engineering on Microsoft Azure — Question 65
You have two Azure Blob Storage accounts named account1 and account2.
You plan to create an Azure Data Factory pipeline that will use scheduled intervals to replicate newly created or modified blobs from account1 to account2.
You need to recommend a solution to implement the pipeline. The solution must meet the following requirements:
• Ensure that the pipeline only copies blobs that were created or modified since the most recent replication event.
• Minimize the effort to create the pipeline.
What should you recommend?
Answer options
- A. Run the Copy Data tool and select Metadata-driven copy task.
- B. Create a pipeline that contains a Data Flow activity.
- C. Create a pipeline that contains a flowlet.
- D. Run the Copy Data tool and select Built-in copy task.
Correct answer: D
Explanation
The correct answer is D because the Built-in copy task in the Copy Data tool is specifically designed for efficient data replication and can easily handle the requirement of copying only newly created or modified blobs. Options A and C are not as effective for this specific scenario, and option B adds unnecessary complexity by involving a Data Flow activity, which is not required for the task at hand.