AWS Certified Solutions Architect – Professional — Question 827

A company receives clickstream data files to Amazon S3 every five minutes. A Python script runs as a cron job once a day on an Amazon EC2 instance to process each file and load it into a database hosted on Amazon RDS. The cron job takes 15 to 30 minutes to process 24 hours of data. The data consumers ask for the data be available as soon as possible.
Which solution would accomplish the desired outcome?

Answer options

Correct answer: D

Explanation

Using AWS Lambda triggered by Amazon S3 event notifications allows the system to process clickstream data in near real-time as soon as each file arrives, meeting the requirement for immediate availability. Options A, B, and C still introduce unnecessary delays (hourly or daily) and rely on scheduling instead of event-driven execution. Furthermore, Option D eliminates the overhead of managing Amazon EC2 instances.