Google Cloud Associate Data Practitioner — Question 3

Your company is building a near real-time streaming pipeline to process JSON telemetry data from small appliances. You need to process messages arriving at a Pub/Sub topic, capitalize letters in the serial number field, and write results to BigQuery. You want to use a managed service and write a minimal amount of code for underlying transformations. What should you do?

Answer options

Correct answer: C

Explanation

The correct answer is C because using the 'Pub/Sub to BigQuery' Dataflow template with a UDF allows for efficient processing of messages with minimal code for transformations. Option A requires a scheduled query, which doesn't handle real-time processing efficiently. Option B involves more complexity by using Cloud Storage and a Cloud Run service, which is not as streamlined. Option D also adds unnecessary complexity with a push subscription and a Cloud Run service.