Google Cloud Professional Cloud DevOps Engineer — Question 3
Your organization wants to implement Site Reliability Engineering (SRE) culture and principles. Recently, a service that you support had a limited outage. A manager on another team asks you to provide a formal explanation of what happened so they can action remediations. What should you do?
Answer options
- A. Develop a postmortem that includes the root causes, resolution, lessons learned, and a prioritized list of action items. Share it with the manager only.
- B. Develop a postmortem that includes the root causes, resolution, lessons learned, and a prioritized list of action items. Share it on the engineering organization's document portal.
- C. Develop a postmortem that includes the root causes, resolution, lessons learned, the list of people responsible, and a list of action items for each person. Share it with the manager only.
- D. Develop a postmortem that includes the root causes, resolution, lessons learned, the list of people responsible, and a list of action items for each person. Share it on the engineering organization's document portal.
Correct answer: B
Explanation
The correct answer is B because sharing the postmortem on the engineering organization's document portal promotes transparency and allows all relevant stakeholders to learn from the incident, fostering a culture of continuous improvement. Option A limits the information to one manager, while options C and D complicate the report by adding unnecessary details about individual responsibilities, which may detract from the overall lessons learned.