Google Cloud Professional Data Engineer — Question 102
You work for a global shipping company. You want to train a model on 40 TB of data to predict which ships in each geographic region are likely to cause delivery delays on any given day. The model will be based on multiple attributes collected from multiple sources. Telemetry data, including location in GeoJSON format, will be pulled from each ship and loaded every hour. You want to have a dashboard that shows how many and which ships are likely to cause delays within a region. You want to use a storage solution that has native functionality for prediction and geospatial processing. Which storage solution should you use?
Answer options
- A. BigQuery
- B. Cloud Bigtable
- C. Cloud Datastore
- D. Cloud SQL for PostgreSQL
Correct answer: A
Explanation
BigQuery is the correct choice because it provides advanced analytical capabilities, including built-in support for geospatial data and predictive analytics, which are essential for this scenario. Cloud Bigtable, Cloud Datastore, and Cloud SQL for PostgreSQL do not offer the same level of functionality for large-scale data analysis and geospatial processing, making them less suitable for the needs of the shipping company.