AWS Certified Machine Learning – Specialty — Question 192

A company is using Amazon SageMaker to build a machine learning (ML) model to predict customer churn based on customer call transcripts. Audio files from customer calls are located in an on-premises VoIP system that has petabytes of recorded calls. The on-premises infrastructure has high-velocity networking and connects to the company's AWS infrastructure through a VPN connection over a 100 Mbps connection.

The company has an algorithm for transcribing customer calls that requires GPUs for inference. The company wants to store these transcriptions in an Amazon S3 bucket in the AWS Cloud for model development.

Which solution should an ML specialist use to deliver the transcriptions to the S3 bucket as quickly as possible?

Answer options

Correct answer: A

Explanation

The correct answer, A, is effective because using an AWS Snowball Edge Compute Optimized device with an NVIDIA Tesla module allows for efficient processing of large datasets on-premises before transferring the results to S3. Options B and C do not provide the same level of GPU support or optimal speed for transcription, while option D introduces unnecessary delays by relying on AWS Lambda after files are uploaded to S3.