You work on an operations team at an international company that manages a large fleet of…

Question

You work on an operations team at an international company that manages a large fleet of on-premises servers located in few data centers around the world. Your team collects monitoring data from the servers, including CPU/memory consumption. When an incident occurs on a server, your team is responsible for fixing it. Incident data has not been properly labeled yet. Your management team wants you to build a predictive maintenance solution that uses monitoring data from the VMs to detect potential failures and then alerts the service desk team. What should you do first?

Accepted Answer

Correct answer: B. B. Implement a simple heuristic (e.g., based on z-score) to label the machines’ historical performance data. Train a model to predict anomalies based on this labeled dataset. — The correct answer is B because implementing a heuristic to label historical performance data allows for a structured dataset, which is essential for training an effective anomaly detection model. Options A and C do not address the need for labeled data, and D could be time-consuming and resource-intensive compared to using a heuristic method.

Google Cloud Professional Machine Learning Engineer — Question 109

Answer options

Correct answer: B

Explanation