AWS Certified Solutions Architect – Associate (SAA-C02) — Question 387

A company is developing a new machine learning model solution in AWS. The models are developed as independent microservices that fetch about 1 GB of model data from Amazon S3 at startup and load the data into memory. Users access the models through an asynchronous API. Users can send a request or a batch of requests and specify where the results should be sent.
The company provides models to hundreds of users. The usage patterns for the models are irregular Some models could be unused for days or weeks. Other models could receive batches of thousands of requests at a time.
Which solution meets these requirements?

Answer options

Correct answer: D

Explanation

AWS Lambda is not suitable for this scenario because downloading and loading 1 GB of model data during startup would cause unacceptable latency during cold starts, ruling out options A and B. Amazon ECS is the correct environment for these heavyweight containers, and Amazon SQS is ideal for decoupling and handling the asynchronous, irregular batch requests. Option D is correct because AWS Auto Scaling can scale both the ECS tasks and the underlying container instances based on the SQS queue depth, whereas App Mesh in option C is a service mesh and cannot scale ECS cluster instances.