AWS Certified Solutions Architect – Associate (SAA-C03) — Question 100
A company hosts more than 300 global websites and applications. The company requires a platform to analyze more than 30 TB of clickstream data each day.
What should a solutions architect do to transmit and process the clickstream data?
Answer options
- A. Design an AWS Data Pipeline to archive the data to an Amazon S3 bucket and run an Amazon EMR cluster with the data to generate analytics.
- B. Create an Auto Scaling group of Amazon EC2 instances to process the data and send it to an Amazon S3 data lake for Amazon Redshift to use for analysis.
- C. Cache the data to Amazon CloudFront. Store the data in an Amazon S3 bucket. When an object is added to the S3 bucket. run an AWS Lambda function to process the data for analysis.
- D. Collect the data from Amazon Kinesis Data Streams. Use Amazon Kinesis Data Firehose to transmit the data to an Amazon S3 data lake. Load the data in Amazon Redshift for analysis.
Correct answer: D
Explanation
The correct answer, D, is appropriate because it utilizes Amazon Kinesis Data Streams and Kinesis Data Firehose to efficiently collect and transmit large volumes of clickstream data to an Amazon S3 data lake, which can then be analyzed using Amazon Redshift. Options A and B do not address real-time data processing needs, while option C does not provide an effective method for handling such large volumes of data daily.