AWS Certified Machine Learning Engineer – Associate (MLA-C01) — Question 63

A company has an ML model that generates text descriptions based on images that customers upload to the company's website. The images can be up to 50 MB in total size.
An ML engineer decides to store the images in an Amazon S3 bucket. The ML engineer must implement a processing solution that can scale to accommodate changes in demand.
Which solution will meet these requirements with the LEAST operational overhead?

Answer options

Correct answer: B

Explanation

The correct answer is B because using an Amazon SageMaker Asynchronous Inference endpoint allows for automatic scaling and efficient handling of inference requests with minimal operational overhead. Option A is less ideal as batch processing may not handle varying demand as efficiently, while options C and D introduce more complexity and operational overhead by requiring additional infrastructure management.