AWS Certified Generative AI – Professional (AIP-C01) — Question 7
A financial services company is building a customer support application that retrieves relevant financial regulation documents from a database based on semantic similarities to user queries. The application must integrate with Amazon Bedrock to generate responses. The application must be able to search documents that are in English, Spanish, and Portuguese. The application must filter documents by metadata such as publication date, regulatory agency, and document type.
The database stores approximately 10 million document embeddings. To minimize operational overhead, the company wants a solution that minimizes management and maintenance effort. The application must provide low-latency responses for real-time customer interactions.
Which solution will meet these requirements?
Answer options
- A. Use Amazon OpenSearch Serverless to provide vector search capabilities and metadata filtering. Connect to Amazon Bedrock Knowledge Bases to enable Retrieval Augmented Generation (RAG) capabilities that use an Anthropic Claude foundation model (FM).
- B. Deploy an Amazon Aurora PostgreSQL database with the pgvector extension. Define tables to store embeddings and metadata. Use SQL queries to perform similarity searches. Send retrieved documents to Amazon Bedrock to generate responses.
- C. Use Amazon S3 Vectors to configure a vector index and non-filterable metadata fields. Integrate S3 Vectors with Amazon Bedrock to enable Retrieval Augmented Generation (RAG) capabilities.
- D. Set up an Amazon Neptune Analytics graph database. Configure a vector index that has appropriate dimensionality to store document embeddings. Use Amazon Bedrock to perform graph-based retrieval and to generate responses.
Correct answer: A
Explanation
Option A is correct as it utilizes Amazon OpenSearch Serverless for efficient vector search and metadata filtering, while integrating seamlessly with Amazon Bedrock for RAG capabilities. Option B requires more management with an Aurora database and SQL queries, which contradicts the requirement to minimize operational overhead. Option C is not appropriate as S3 Vectors does not provide the necessary capabilities for metadata filtering, and Option D, while capable of graph-based retrieval, adds unnecessary complexity and management compared to the serverless approach of Option A.