Google Cloud Professional Data Engineer — Question 147
You store and analyze your relational data in BigQuery on Google Cloud with all data that resides in US regions. You also have a variety of object stores across Microsoft Azure and Amazon Web Services (AWS), also in US regions. You want to query all your data in BigQuery daily with as little movement of data as possible. What should you do?
Answer options
- A. Use BigQuery Data Transfer Service to load files from Azure and AWS into BigQuery.
- B. Create a Dataflow pipeline to ingest files from Azure and AWS to BigQuery.
- C. Load files from AWS and Azure to Cloud Storage with Cloud Shell gsutil rsync arguments.
- D. Use the BigQuery Omni functionality and BigLake tables to query files in Azure and AWS.
Correct answer: D
Explanation
The correct answer is D because BigQuery Omni allows you to query data stored in Azure and AWS without moving it, which aligns with your goal of minimizing data movement. Options A, B, and C all involve transferring data into BigQuery, which would contradict the requirement for minimal data movement.