AWS Certified Machine Learning – Specialty — Question 97

A Machine Learning Specialist is designing a scalable data storage solution for Amazon SageMaker. There is an existing TensorFlow-based model implemented as a train.py script that relies on static training data that is currently stored as TFRecords.
Which method of providing training data to Amazon SageMaker would meet the business requirements with the LEAST development overhead?

Answer options

Correct answer: B

Explanation

Option B is correct because it allows the existing train.py script to be used unchanged while simply pointing to the S3 bucket, minimizing development overhead. Option A fails to meet the requirement since it does not utilize S3, and Options C and D involve significant modifications or preprocessing of the data, increasing development complexity.