Google Cloud Professional Cloud Architect — Question 135
Your team is developing a web application that will be deployed on Google Kubernetes Engine (GKE). Your CTO expects a successful launch and you need to ensure your application can handle the expected load of tens of thousands of users. You want to test the current deployment to ensure the latency of your application stays below a certain threshold. What should you do?
Answer options
- A. Use a load testing tool to simulate the expected number of concurrent users and total requests to your application, and inspect the results.
- B. Enable autoscaling on the GKE cluster and enable horizontal pod autoscaling on your application deployments. Send curl requests to your application, and validate if the auto scaling works.
- C. Replicate the application over multiple GKE clusters in every Google Cloud region. Configure a global HTTP(S) load balancer to expose the different clusters over a single global IP address.
- D. Use Cloud Debugger in the development environment to understand the latency between the different microservices.
Correct answer: A
Explanation
The correct answer is A because using a load testing tool helps simulate real-world traffic and provides direct insights into how the application performs under load, allowing you to measure latency effectively. Options B and C focus on scaling and distribution, which do not directly test latency under load. Option D involves debugging but does not provide a comprehensive test of application performance under expected user loads.