Implementing Data Engineering Solutions Using Microsoft Fabric — Question 14
You have a Fabric workspace that contains a lakehouse named Lakehouse1.
In an external data source, you have data files that are 500 GB each. A new file is added every day.
You need to ingest the data into Lakehouse1 without applying any transformations. The solution must meet the following requirements
Trigger the process when a new file is added.
Provide the highest throughput.
Which type of item should you use to ingest the data?
Answer options
- A. Eventstream
- B. Dataflow Gen2
- C. Streaming dataset
- D. Data pipeline
Correct answer: D
Explanation
The correct answer is D, Data pipeline, as it is designed to efficiently ingest data from external sources based on triggers, ensuring high throughput. The other options, while useful for other tasks, do not specifically meet the requirements for ingestion with the necessary triggering and throughput capabilities.