Google Cloud Professional Data Engineer — Question 162
Your organization uses a multi-cloud data storage strategy, storing data in Cloud Storage, and data in Amazon Web Services’ (AWS) S3 storage buckets. All data resides in US regions. You want to query up-to-date data by using BigQuery, regardless of which cloud the data is stored in. You need to allow users to query the tables from BigQuery without giving direct access to the data in the storage buckets. What should you do?
Answer options
- A. Setup a BigQuery Omni connection to the AWS S3 bucket data. Create BigLake tables over the Cloud Storage and S3 data and query the data using BigQuery directly.
- B. Set up a BigQuery Omni connection to the AWS S3 bucket data. Create external tables over the Cloud Storage and S3 data and query the data using BigQuery directly.
- C. Use the Storage Transfer Service to copy data from the AWS S3 buckets to Cloud Storage buckets. Create BigLake tables over the Cloud Storage data and query the data using BigQuery directly.
- D. Use the Storage Transfer Service to copy data from the AWS S3 buckets to Cloud Storage buckets. Create external tables over the Cloud Storage data and query the data using BigQuery directly.
Correct answer: A
Explanation
The correct answer is A because setting up a BigQuery Omni connection allows seamless querying of data across both Cloud Storage and AWS S3 using BigLake tables, which enables data to remain in their original locations while still being accessible. Options B and D suggest creating external tables, which would not allow for the same level of integration and performance as BigLake. Option C involves copying data to Cloud Storage, which is unnecessary and could lead to data latency and additional storage costs.