AWS Certified Data Analytics – Specialty — Question 97

A large telecommunications company is planning to set up a data catalog and metadata management for multiple data sources running on AWS. The catalog will be used to maintain the metadata of all the objects stored in the data stores. The data stores are composed of structured sources like Amazon RDS and Amazon
Redshift, and semistructured sources like JSON and XML files stored in Amazon S3. The catalog must be updated on a regular basis, be able to detect the changes to object metadata, and require the least possible administration.
Which solution meets these requirements?

Answer options

Correct answer: B

Explanation

The correct answer is B because AWS Glue Data Catalog is specifically designed for managing and updating metadata from various sources with minimal administrative effort. Other options like using Amazon Aurora or DynamoDB would require more management and manual updates, which contradicts the requirement for less administration. Option D, while also utilizing AWS Glue, does not address the requirement for regular updates through crawlers as effectively as option B does.