Google Cloud Professional Data Engineer — Question 304
You're using Bigtable for a real-time application, and you have a heavy load that is a mix of read and writes. You've recently identified an additional use case and need to perform hourly an analytical job to calculate certain statistics across the whole database. You need to ensure both the reliability of your production application as well as the analytical workload.
What should you do?
Answer options
- A. Export Bigtable dump to GCS and run your analytical job on top of the exported files.
- B. Add a second cluster to an existing instance with a multi-cluster routing, use live-traffic app profile for your regular workload and batch-analytics profile for the analytics workload.
- C. Add a second cluster to an existing instance with a single-cluster routing, use live-traffic app profile for your regular workload and batch-analytics profile for the analytics workload.
- D. Increase the size of your existing cluster twice and execute your analytics workload on your new resized cluster.
Correct answer: C
Explanation
Option C is correct because adding a second cluster with single-cluster routing allows for efficient handling of both the real-time application and the analytical workload without causing interference. Option A does not provide real-time access to data, as it relies on exported files. Option B, while using multi-cluster routing, does not align with the requirement for maintaining production reliability with the additional workload. Option D may increase capacity but does not effectively separate the analytical workload from the production application.