A company recently migrated to the AWS Cloud. The company wants a serverless solution for…

Question

A company recently migrated to the AWS Cloud. The company wants a serverless solution for large-scale parallel on-demand processing of a semistructured dataset. The data consists of logs, media files, sales transactions, and IoT sensor data that is stored in Amazon S3. The company wants the solution to process thousands of items in the dataset in parallel. Which solution will meet these requirements with the MOST operational efficiency?

Accepted Answer

Correct answer: B. B. Use the AWS Step Functions Map state in Distributed mode to process the data in parallel. — AWS Step Functions Map state in Distributed mode is optimized for high-concurrency, serverless processing of massive datasets in Amazon S3, allowing up to 10,000 parallel executions. In contrast, Inline mode is limited to only 40 concurrent iterations, which is inadequate for processing thousands of items in parallel. While AWS Glue and custom AWS Lambda orchestrations could process the data, they require significantly more operational and management overhead compared to the native Distributed Map state.

AWS Certified Solutions Architect – Associate (SAA-C03) — Question 985

Answer options

Correct answer: B

Explanation