Google Cloud Professional Data Engineer — Question 115
You currently have a single on-premises Kafka cluster in a data center in the us-east region that is responsible for ingesting messages from IoT devices globally.
Because large parts of globe have poor internet connectivity, messages sometimes batch at the edge, come in all at once, and cause a spike in load on your
Kafka cluster. This is becoming difficult to manage and prohibitively expensive. What is the Google-recommended cloud native architecture for this scenario?
Answer options
- A. Edge TPUs as sensor devices for storing and transmitting the messages.
- B. Cloud Dataflow connected to the Kafka cluster to scale the processing of incoming messages.
- C. An IoT gateway connected to Cloud Pub/Sub, with Cloud Dataflow to read and process the messages from Cloud Pub/Sub.
- D. A Kafka cluster virtualized on Compute Engine in us-east with Cloud Load Balancing to connect to the devices around the world.
Correct answer: C
Explanation
The correct answer is C because an IoT gateway connected to Cloud Pub/Sub allows for efficient handling of messages from devices with poor connectivity, enabling better scalability and reliability. Options A and B do not provide the necessary architecture to manage message spikes effectively, while option D does not address the core issue of internet connectivity and scalability in a cloud-native manner.