AWS Certified Solutions Architect – Associate (SAA-C03) — Question 772
A company's application runs on Amazon EC2 instances that are in multiple Availability Zones. The application needs to ingest real-time data from third-party applications.
The company needs a data ingestion solution that places the ingested raw data in an Amazon S3 bucket.
Which solution will meet these requirements?
Answer options
- A. Create Amazon Kinesis data streams for data ingestion. Create Amazon Kinesis Data Firehose delivery streams to consume the Kinesis data streams. Specify the S3 bucket as the destination of the delivery streams.
- B. Create database migration tasks in AWS Database Migration Service (AWS DMS). Specify replication instances of the EC2 instances as the source endpoints. Specify the S3 bucket as the target endpoint. Set the migration type to migrate existing data and replicate ongoing changes.
- C. Create and configure AWS DataSync agents on the EC2 instances. Configure DataSync tasks to transfer data from the EC2 instances to the S3 bucket.
- D. Create an AWS Direct Connect connection to the application for data ingestion. Create Amazon Kinesis Data Firehose delivery streams to consume direct PUT operations from the application. Specify the S3 bucket as the destination of the delivery streams.
Correct answer: A
Explanation
Amazon Kinesis Data Streams combined with Amazon Kinesis Data Firehose is the standard, highly scalable AWS architecture for ingesting real-time streaming data and delivering it directly to Amazon S3. AWS DMS and AWS DataSync are designed for database migrations and bulk file transfers, respectively, rather than real-time application stream ingestion. AWS Direct Connect is a private network connectivity service and is not suited for ingesting real-time third-party data over the public internet.