A Machine Learning Specialist is designing a scalable data storage solution for Amazon Sa…

Question

A Machine Learning Specialist is designing a scalable data storage solution for Amazon SageMaker. There is an existing TensorFlow-based model implemented as a train.py script that relies on static training data that is currently stored as TFRecords.
Which method of providing training data to Amazon SageMaker would meet the business requirements with the LEAST development overhead?

Accepted Answer

Correct answer: B. B. Use Amazon SageMaker script mode and use train.py unchanged. Put the TFRecord data into an Amazon S3 bucket. Point the Amazon SageMaker training invocation to the S3 bucket without reformatting the training data. — Option B is correct because it allows the existing train.py script to be used unchanged while simply pointing to the S3 bucket, minimizing development overhead. Option A fails to meet the requirement since it does not utilize S3, and Options C and D involve significant modifications or preprocessing of the data, increasing development complexity.

AWS Certified Machine Learning – Specialty — Question 97

Answer options

Correct answer: B

Explanation