A Generative AI Engineer is deciding between using LSH (Locality Sensitive Hashing) and H…

Question

A Generative AI Engineer is deciding between using LSH (Locality Sensitive Hashing) and HNSW (Hierarchical Navigable Small World) for indexing their vector database. Their top priority is semantic accuracy. Which approach should the Generative AI Engineer use to evaluate these two techniques?

Accepted Answer

Correct answer: A. A. Compare the cosine similarities of the embeddings of returned results against those of a representative sample of test inputs — The correct answer is A because comparing cosine similarities helps evaluate how similar the vector embeddings of the returned results are to the test inputs, directly relating to semantic accuracy. Options B and C focus on different evaluation metrics that are more suited for text generation tasks rather than semantic similarity in vector space, while option D, Levenshtein distance, measures edit distance, which is not appropriate for assessing semantic accuracy in this context.

Databricks Certified Generative AI Engineer Associate — Question 61

Answer options

Correct answer: A

Explanation