A company wants to create a data repository in the AWS Cloud for machine learning (ML) pr…

Question

A company wants to create a data repository in the AWS Cloud for machine learning (ML) projects. The company wants to use AWS to perform complete ML lifecycles and wants to use Amazon S3 for the data storage. All of the company's data currently resides on premises and is 40 ׀¢׀’ in size.
The company wants a solution that can transfer and automatically update data between the on-premises object storage and Amazon S3. The solution must support encryption, scheduling, monitoring, and data integrity validation.
Which solution meets these requirements?

Accepted Answer

Correct answer: C. C. Use AWS DataSync to make an initial copy of the entire dataset. Schedule subsequent incremental transfers of changing data until the final cutover from on premises to AWS. — The correct answer is C because AWS DataSync is specifically designed to transfer large amounts of data efficiently and supports features such as scheduling, encryption, and data integrity validation. Options A and D do not provide the necessary automation or support for ongoing data synchronization, while option B does not address the requirement for automatic updates.

AWS Certified Machine Learning – Specialty — Question 176

Answer options

Correct answer: C

Explanation