AWS Certified Machine Learning – Specialty — Question 228
An ecommerce company is collecting structured data and unstructured data from its website, mobile apps, and IoT devices. The data is stored in several databases and Amazon S3 buckets. The company is implementing a scalable repository to store structured data and unstructured data. The company must implement a solution that provides a central data catalog, self-service access to the data, and granular data access policies and encryption to protect the data.
Which combination of actions will meet these requirements with the LEAST amount of setup? (Choose three.)
Answer options
- A. Identify the existing data in the databases and S3 buckets. Link the data to AWS Lake Formation.
- B. Identify the existing data in the databases and S3 buckets. Link the data to AWS Glue.
- C. Run AWS Glue crawlers on the linked data sources to create a central data catalog.
- D. Apply granular access policies by using AWS Identity and Access Management (1AM). Configure server-side encryption on each data source.
- E. Apply granular access policies and encryption by using AWS Lake Formation.
- F. Apply granular access policies and encryption by using AWS Glue.
Correct answer: A, C, E
Explanation
The correct actions involve identifying the existing data and linking it to AWS Lake Formation, using AWS Glue crawlers to create a central data catalog, and applying granular access policies and encryption through AWS Lake Formation. Linking to AWS Glue (option B) does not provide the same level of centralized management as Lake Formation, while the other options (D and F) do not fulfill the requirement for a central data catalog.