Databricks Certified Generative AI Engineer Associate — Question 1

A Generative Al Engineer has created a RAG application to look up answers to questions about a series of fantasy novels that are being asked on the author’s web forum. The fantasy novel texts are chunked and embedded into a vector store with metadata (page number, chapter number, book title), retrieved with the user’s query, and provided to an LLM for response generation. The Generative AI Engineer used their intuition to pick the chunking strategy and associated configurations but now wants to more methodically choose the best values.
Which TWO strategies should the Generative AI Engineer take to optimize their chunking strategy and parameters? (Choose two.)

Answer options

Correct answer: C, E

Explanation

Option C is correct because it focuses on evaluating different chunking strategies using performance metrics, which is essential for optimization. Option E is also valid as it introduces a metric based on LLM performance for evaluating chunk relevance. Options A and B do not directly address the optimization of chunking parameters, while Option D suggests a method that may not accurately reflect chunk performance related to retrieval.