A company needs to store objects containing log data in JSON format. The objects are gene…

Question

A company needs to store objects containing log data in JSON format. The objects are generated by eight applications running in AWS. Six of the applications generate a total of 500 KiB of data per second, and two of the applications can generate up to 2 MiB of data per second.
A data engineer wants to implement a scalable solution to capture and store usage data in an Amazon S3 bucket. The usage data objects need to be reformatted, converted to .csv format, and then compressed before they are stored in Amazon S3. The company requires the solution to include the least custom code possible and has authorized the data engineer to request a service quota increase if needed.
Which solution meets these requirements?

Accepted Answer

Correct answer: A. A. Configure an Amazon Kinesis Data Firehose delivery stream for each application. Write AWS Lambda functions to read log data objects from the stream for each application. Have the function perform reformatting and .csv conversion. Enable compression on all the delivery streams. — Option A is correct because it utilizes Amazon Kinesis Data Firehose, which is designed for real-time data processing with minimal custom code. It allows for automatic formatting, conversion to .csv, and compression in a single stream configuration for each application. The other options either require more custom coding (B and C) or do not meet the requirement for minimal custom code (D).

AWS Certified Data Analytics – Specialty — Question 68

Answer options

Correct answer: A

Explanation