Designing Azure Infrastructure Solutions — Question 106
You have an Azure virtual machine named VM1 that runs Windows Server 2019 and contains 500 GB of data files.
You are designing a solution that will use Azure Data Factory to transform the data files, and then load the files to Azure Data Lake Storage.
What should you deploy on VM1 to support the design?
Answer options
- A. the On-premises data gateway
- B. the Azure Pipelines agent
- C. the self-hosted integration runtime
- D. the Azure File Sync agent
Correct answer: C
Explanation
The self-hosted integration runtime is necessary for Azure Data Factory to connect to on-premises data sources like the files on VM1. The On-premises data gateway facilitates other services, but the self-hosted integration runtime is specifically designed for integration and transformation tasks. The Azure Pipelines agent and the Azure File Sync agent do not serve the purpose of connecting Azure Data Factory to on-premises data for transformation and loading.