AWS Certified Machine Learning – Specialty — Question 51
A Data Scientist wants to gain real-time insights into a data stream of GZIP files.
Which solution would allow the use of SQL to query the stream with the LEAST latency?
Answer options
- A. Amazon Kinesis Data Analytics with an AWS Lambda function to transform the data.
- B. AWS Glue with a custom ETL script to transform the data.
- C. An Amazon Kinesis Client Library to transform the data and save it to an Amazon ES cluster.
- D. Amazon Kinesis Data Firehose to transform the data and put it into an Amazon S3 bucket.
Correct answer: A
Explanation
The correct answer is A because Amazon Kinesis Data Analytics is specifically designed for real-time data processing and allows SQL queries with minimal latency. Options B, C, and D involve additional layers of processing or storage that would introduce higher latency compared to direct querying with Kinesis Data Analytics.