A financial company receives a high volume of real-time market data streams from an exter…

Question

A financial company receives a high volume of real-time market data streams from an external provider. The streams consist of thousands of JSON records every second.
The company needs to implement a scalable solution on AWS to identify anomalous data points.
Which solution will meet these requirements with the LEAST operational overhead?

Accepted Answer

Correct answer: A. A. Ingest real-time data into Amazon Kinesis data streams. Use the built-in RANDOM_CUT_FOREST function in Amazon Managed Service for Apache Flink to process the data streams and to detect data anomalies. — Option A is the best choice because it leverages Amazon Kinesis data streams and the RANDOM_CUT_FOREST function in Amazon Managed Service for Apache Flink, which is specifically designed for real-time anomaly detection with minimal operational management. Options B and C involve additional components like SageMaker and Lambda, increasing complexity, while option D relies on batch processing with AWS Glue, which is not suited for real-time requirements.

AWS Certified Machine Learning Engineer – Associate (MLA-C01) — Question 30

Answer options

Correct answer: A

Explanation