AWS Certified Machine Learning Engineer – Associate (MLA-C01) — Question 30

A financial company receives a high volume of real-time market data streams from an external provider. The streams consist of thousands of JSON records every second.
The company needs to implement a scalable solution on AWS to identify anomalous data points.
Which solution will meet these requirements with the LEAST operational overhead?

Answer options

Correct answer: A

Explanation

Option A is the best choice because it leverages Amazon Kinesis data streams and the RANDOM_CUT_FOREST function in Amazon Managed Service for Apache Flink, which is specifically designed for real-time anomaly detection with minimal operational management. Options B and C involve additional components like SageMaker and Lambda, increasing complexity, while option D relies on batch processing with AWS Glue, which is not suited for real-time requirements.