Google Cloud Professional Cloud DevOps Engineer — Question 40
Your company follows Site Reliability Engineering principles. You are writing a postmortem for an incident, triggered by a software change, that severely affected users. You want to prevent severe incidents from happening in the future. What should you do?
Answer options
- A. Identify engineers responsible for the incident and escalate to their senior management.
- B. Ensure that test cases that catch errors of this type are run successfully before new software releases.
- C. Follow up with the employees who reviewed the changes and prescribe practices they should follow in the future.
- D. Design a policy that will require on-call teams to immediately call engineers and management to discuss a plan of action if an incident occurs.
Correct answer: B
Explanation
The correct answer is B because running comprehensive test cases before software releases helps identify potential issues that could lead to severe incidents, thereby preventing them. Option A focuses on blame rather than prevention, option C addresses post-incident review without improving future processes, and option D centers on immediate response rather than proactive measures to reduce the likelihood of incidents.