AWS Certified Machine Learning – Specialty — Question 146

A data scientist has a dataset of machine part images stored in Amazon Elastic File System (Amazon EFS). The data scientist needs to use Amazon SageMaker to create and train an image classification machine learning model based on this dataset. Because of budget and time constraints, management wants the data scientist to create and train a model with the least number of steps and integration work required.
How should the data scientist meet these requirements?

Answer options

Correct answer: D

Explanation

The correct answer is D because it allows the data scientist to directly use the EFS file system as the data source for the SageMaker training job, minimizing steps and integration work. Options A, B, and C involve transferring data to other storage services like Amazon FSx for Lustre or Amazon S3, which adds unnecessary complexity and time to the process.