AWS Certified Data Analytics – Specialty — Question 138

A company has 1 million scanned documents stored as image files in Amazon S3. The documents contain typewritten application forms with information including the applicant first name, applicant last name, application date, application type, and application text. The company has developed a machine learning algorithm to extract the metadata values from the scanned documents. The company wants to allow internal data analysts to analyze and find applications using the applicant name, application date, or application text. The original images should also be downloadable. Cost control is secondary to query performance.
Which solution organizes the images and metadata to drive insights while meeting the requirements?

Answer options

Correct answer: B

Explanation

Option B is correct because it utilizes Amazon OpenSearch Service (Amazon Elasticsearch Service) to index the metadata and provide a powerful search capability for data analysts. This approach allows for efficient querying on the applicant's name, application date, and application text. Other options, while functional, do not provide the same level of performance or flexibility in querying compared to OpenSearch Service, making them less suitable for the specified requirements.