AWS Certified Machine Learning – Specialty — Question 87

A trucking company is collecting live image data from its fleet of trucks across the globe. The data is growing rapidly and approximately 100 GB of new data is generated every day. The company wants to explore machine learning uses cases while ensuring the data is only accessible to specific IAM users.
Which storage option provides the most processing flexibility and will allow access control with IAM?

Answer options

Correct answer: B

Explanation

The correct answer is B because an Amazon S3-backed data lake provides flexibility in processing and scalability for large amounts of data while allowing fine-grained access control through bucket policies. Option A is incorrect because DynamoDB is not ideal for storing large binary images. Option C is not suitable as HDFS is more complex and less flexible for the stated use case. Option D does not provide the same level of data lake capabilities that S3 offers for machine learning applications.