Google Cloud Professional Data Engineer — Question 297
You are migrating your data warehouse to Google Cloud and decommissioning your on-premises data center. Because this is a priority for your company, you know that bandwidth will be made available for the initial data load to the cloud. The files being transferred are not large in number, but each file is 90 GB.
Additionally, you want your transactional systems to continually update the warehouse on Google Cloud in real time. What tools should you use to migrate the data and ensure that it continues to write to your warehouse?
Answer options
- A. Storage Transfer Service for the migration; Pub/Sub and Cloud Data Fusion for the real-time updates
- B. BigQuery Data Transfer Service for the migration; Pub/Sub and Dataproc for the real-time updates
- C. gsutil for the migration; Pub/Sub and Dataflow for the real-time updates
- D. gsutil for both the migration and the real-time updates
Correct answer: C
Explanation
The correct answer is C because gsutil is an effective tool for transferring large files, making it suitable for the initial migration of 90 GB files. Additionally, using Pub/Sub alongside Dataflow allows for real-time updates to the data warehouse. The other options either suggest inappropriate tools for the migration or do not adequately facilitate real-time updates.