A company is using Amazon SageMaker to build a machine learning (ML) model to predict cus…

Question

A company is using Amazon SageMaker to build a machine learning (ML) model to predict customer churn based on customer call transcripts. Audio files from customer calls are located in an on-premises VoIP system that has petabytes of recorded calls. The on-premises infrastructure has high-velocity networking and connects to the company's AWS infrastructure through a VPN connection over a 100 Mbps connection. The company has an algorithm for transcribing customer calls that requires GPUs for inference. The company wants to store these transcriptions in an Amazon S3 bucket in the AWS Cloud for model development. Which solution should an ML specialist use to deliver the transcriptions to the S3 bucket as quickly as possible?

Accepted Answer

Correct answer: A. A. Order and use an AWS Snowball Edge Compute Optimized device with an NVIDIA Tesla module to run the transcription algorithm. Use AWS DataSync to send the resulting transcriptions to the transcription S3 bucket. — The correct answer, A, is effective because using an AWS Snowball Edge Compute Optimized device with an NVIDIA Tesla module allows for efficient processing of large datasets on-premises before transferring the results to S3. Options B and C do not provide the same level of GPU support or optimal speed for transcription, while option D introduces unnecessary delays by relying on AWS Lambda after files are uploaded to S3.

AWS Certified Machine Learning – Specialty — Question 192

Answer options

Correct answer: A

Explanation