Google Cloud Professional Cloud DevOps Engineer — Question 73

You support a service that recently had an outage. The outage was caused by a new release that exhausted the service memory resources. You rolled back the release successfully to mitigate the impact on users. You are now in charge of the post-mortem for the outage. You want to follow Site Reliability Engineering practices when developing the post-mortem. What should you do?

Answer options

Correct answer: B

Explanation

Option B is correct because it emphasizes understanding the root causes of the incident, which aligns with SRE practices focused on improving systems rather than blaming individuals. The other options either divert attention from long-term improvements (A), focus on blame (C), or suggest punitive measures (D), which do not foster a constructive post-mortem process.