Google Cloud Associate Data Practitioner — Question 10
You have a Dataflow pipeline that processes website traffic logs stored in Cloud Storage and writes the processed data to BigQuery. You noticed that the pipeline is failing intermittently. You need to troubleshoot the issue. What should you do?
Answer options
- A. Use Cloud Logging to identify error groups in the pipeline's logs. Use Cloud Monitoring to create a dashboard that tracks the number of errors in each group.
- B. Use Cloud Logging to create a chart displaying the pipeline’s error logs. Use Metrics Explorer to validate the findings from the chart.
- C. Use Cloud Logging to view error messages in the pipeline's logs. Use Cloud Monitoring to analyze the pipeline's metrics, such as CPU utilization and memory usage.
- D. Use the Dataflow job monitoring interface to check the pipeline's status every hour. Use Cloud Profiler to analyze the pipeline’s metrics, such as CPU utilization and memory usage.
Correct answer: C
Explanation
The correct answer, C, is appropriate because it focuses on directly viewing error messages in the logs and analyzing relevant metrics, which is essential for identifying the root cause of the failures. Option A is less effective as it groups errors without examining specific messages, while B focuses on charting without detailed analysis. Option D relies on periodic checks rather than immediate troubleshooting of error logs and metrics.