A company uses Amazon Bedrock to implement a Retrieval Augmented Generation (RAG)-based s…

Question

A company uses Amazon Bedrock to implement a Retrieval Augmented Generation (RAG)-based system to serve medical information to users. The company needs to compare multiple chunking strategies, evaluate the generation quality of two foundation models (FMs), and enforce quality thresholds for deployment.
Which Amazon Bedrock evaluation configuration will meet these requirements?

Accepted Answer

Correct answer: B. B. Create a retrieve-and-generate evaluation job that uses custom precision at k metrics and an LLM-as-a-judge metric that uses a scale of 1-5. Include each chunking strategy in the evaluation dataset. Use a supported version of Anthropic Claude Sonnet to evaluate responses from both FMs. — Option B is correct because it includes both retrieval and generation evaluation, which is essential for comparing chunking strategies and assessing the quality of multiple foundation models. It also uses appropriate metrics that are tailored to evaluate the output effectively. The other options either focus solely on retrieval or do not incorporate the necessary combination of evaluation elements required for the task.

AWS Certified Generative AI – Professional (AIP-C01) — Question 50

Answer options

Correct answer: B

Explanation