AWS Certified Machine Learning Engineer – Associate (MLA-C01) — Question 114

A company needs to use Amazon SageMaker to train a model on more than 300 GB of data. The training data is composed of files that are 200 MB in size. The data is stored in Amazon S3 Standard storage and feeds a dashboard tool.

Which SageMaker training ingestion mechanism is the MOST cost-effective solution for this scenario?

Answer options

Correct answer: D

Explanation

The correct answer is D because using Amazon S3 in fast file mode without S3 Express One Zone allows efficient access to the large dataset while minimizing costs. Options A and B involve additional costs due to the use of file systems that are generally more expensive than S3. Option C adds unnecessary costs by utilizing S3 Express One Zone, which is not needed in this scenario.