Implementing an Azure Data Solution (legacy) — Question 58
You have an Azure Data Lake Storage Gen2 account. You have a number of CSV files loaded in the account. Each file has a header row. After the header row is a property that is formatted by carriage return (/r) and line feed (/n).
You need to load the files daily as a batch into Azure SQL Data warehouse using Polybase. You have to skip the header row when the files are imported.
Which of the following actions would you take to implement this requirement? (Choose three.)
Answer options
- A. Create an external data source and ensure to use the abfs location
- B. Create an external data source and ensure to use the Hadoop location
- C. Create an external file format and set the First_row option
- D. Create a database scoped credential that uses OAuth2 token and a key
- E. Use the CREATE EXTERNAL TABLE AS SELECT and create a view that removes the empty row
Correct answer: A, C, D
Explanation
The correct actions are A, C, and D. Option A is necessary to create an external data source for accessing the files in Azure Data Lake Storage. Option C is essential to specify that the first row should be skipped during the import process. Option D is required to authenticate using OAuth2, which is needed for accessing the data securely. Options B and E are incorrect, as they do not meet the requirement of skipping the header row or are not suitable actions for this scenario.