Databricks Certified Generative AI Engineer Associate — Question 51

A Generative AI Engineer is building a production-ready LLM system which replies directly to customers. The solution makes use of the Foundation Model API via provisioned throughput. They are concerned that the LLM could potentially respond in a toxic or otherwise unsafe way. They also wish to perform this with the least amount of effort.

Which approach will do this?

Answer options

Correct answer: B

Explanation

The correct answer is B because hosting Llama Guard on the Foundation Model API provides a dedicated solution for detecting unsafe responses with minimal additional effort. Option A relies on user reporting, which can be inconsistent and slow. Option C requires additional LLM calls, increasing complexity, while Option D involves regex, which may not effectively capture all unsafe responses.