AWS Certified Solutions Architect – Professional (SAP-C02) — Question 338

A team of data scientists is using Amazon SageMaker instances and SageMaker APIs to train machine learning (ML) models. The SageMaker instances are deployed in a VPC that does not have access to or from the internet. Datasets for ML model training are stored in an Amazon S3 bucket. Interface VPC endpoints provide access to Amazon S3 and the SageMaker APIs.

Occasionally, the data scientists require access to the Python Package Index (PyPI) repository to update Python packages that they use as part of their workflow. A solutions architect must provide access to the PyPI repository while ensuring that the SageMaker instances remain isolated from the internet.

Which solution will meet these requirements?

Answer options

Correct answer: D

Explanation

AWS CodeArtifact allows organizations to securely store and share software packages, and it supports external connections to public repositories like PyPI. By configuring a VPC endpoint for CodeArtifact, SageMaker instances can securely download Python dependencies within the private network without any internet exposure. Using NAT gateways or NAT instances would introduce direct paths to the internet, which violates the strict isolation requirement.