Google Cloud Professional Cloud DevOps Engineer — Question 69
You support a large service with a well-defined Service Level Objective (SLO). The development team deploys new releases of the service multiple times a week.
If a major incident causes the service to miss its SLO, you want the development team to shift its focus from working on features to improving service reliability.
What should you do before a major incident occurs?
Answer options
- A. Develop an appropriate error budget policy in cooperation with all service stakeholders.
- B. Negotiate with the product team to always prioritize service reliability over releasing new features.
- C. Negotiate with the development team to reduce the release frequency to no more than once a week.
- D. Add a plugin to your Jenkins pipeline that prevents new releases whenever your service is out of SLO.
Correct answer: A
Explanation
The correct answer is A because establishing an error budget policy collaboratively ensures that all stakeholders understand the balance between feature development and reliability. Options B and C may not be feasible or well-received by the teams involved, while option D could lead to unnecessary restrictions on the release process, which might not be the best approach for maintaining service agility.