A Generative AI Engineer just deployed an LLM application at a digital marketing company…

Question

A Generative AI Engineer just deployed an LLM application at a digital marketing company that assists with answering customer service inquiries.
Which metric should they monitor for their customer service LLM application in production?

Accepted Answer

Correct answer: A. A. Number of customer inquiries processed per unit of time — The correct answer is A because monitoring the number of customer inquiries processed per unit of time provides insight into the application's efficiency and responsiveness in real-time customer service scenarios. Options B, C, and D are less relevant for assessing the application's operational performance; energy usage per query does not directly indicate effectiveness, perplexity scores pertain to training rather than live performance, and leaderboard values are not applicable to a specific production environment.

Databricks Certified Generative AI Engineer Associate — Question 11

Answer options

Correct answer: A

Explanation