AWS Certified Machine Learning Engineer – Associate (MLA-C01) — Question 156
A company uses an NFS-based data store to store data for ML training. Linux-based systems access the data store.
The company needs a hybrid system to make the shared data store accessible to on-premises servers and Amazon SageMaker AI notebooks that will consume the data. File locking is required for the data producers.
Which AWS storage solution will meet these requirements?
Answer options
- A. Use an Amazon S3 bucket to store the data. Use Mountpoint for Amazon S3 to mount the S3 bucket to the on-premises servers and the SageMaker AI notebooks.
- B. Use an Amazon Elastic File System (Amazon EFS) file system to store the data. Mount the file system to the on-premises servers and the SageMaker AI notebooks.
- C. Use an Amazon FSx for Lustre file system to store the data. Mount the file system to the on-premises servers and the SageMaker AI notebooks.
- D. Use an Amazon Elastic Block Store (Amazon EBS) volume to store the data. Mount the volume to the on-premises servers and the SageMaker AI notebooks.
Correct answer: B
Explanation
The correct answer is B, as Amazon Elastic File System (EFS) provides a fully managed file storage solution that is accessible from multiple instances, allowing for file locking which is essential for data producers. Options A and D do not support file locking for shared access, while C, although it offers shared access, is optimized for high-performance workloads and may not provide the necessary file locking features needed for the scenario.