Google Cloud Professional Data Engineer — Question 163
You are using a Dataflow streaming job to read messages from a message bus that does not support exactly-once delivery. Your job then applies some transformations, and loads the result into BigQuery. You want to ensure that your data is being streamed into BigQuery with exactly-once delivery semantics. You expect your ingestion throughput into BigQuery to be about 1.5 GB per second. What should you do?
Answer options
- A. Use the BigQuery Storage Write API and ensure that your target BigQuery table is regional.
- B. Use the BigQuery Storage Write API and ensure that your target BigQuery table is multiregional.
- C. Use the BigQuery Streaming API and ensure that your target BigQuery table is regional.
- D. Use the BigQuery Streaming API and ensure that your target BigQuery table is multiregional.
Correct answer: B
Explanation
The correct answer is B, as using the BigQuery Storage Write API with a multiregional table allows for higher throughput and ensures exactly-once delivery semantics. Option A is incorrect because a regional table does not provide the same level of performance for large ingestion rates. Options C and D are also incorrect because the Streaming API does not guarantee exactly-once delivery, which is essential for this scenario.