AWS Certified Data Analytics – Specialty — Question 35

A company is building a data lake and needs to ingest data from a relational database that has time-series data. The company wants to use managed services to accomplish this. The process needs to be scheduled daily and bring incremental data only from the source into Amazon S3.
What is the MOST cost-effective approach to meet these requirements?

Answer options

Correct answer: A

Explanation

Option A is the most cost-effective choice because it leverages job bookmarks in AWS Glue to efficiently track and ingest only the incremental changes. Option B introduces additional complexity and potential costs by involving DynamoDB. Option C requires processing the entire dataset, which is inefficient and more expensive. Similarly, option D unnecessarily ingests all data before filtering, making it less cost-effective than option A.