Google Cloud Professional Machine Learning Engineer — Question 270

You work at an organization that maintains a cloud-based communication platform that integrates conventional chat, voice, and video conferencing into one platform. The audio recordings are stored in Cloud Storage. All recordings have an 8 kHz sample rate and are more than one minute long. You need to implement a new feature in the platform that will automatically transcribe voice call recordings into a text for future applications, such as call summarization and sentiment analysis. How should you implement the voice call transcription feature following Google-recommended best practices?

Answer options

Correct answer: B

Explanation

The correct answer is B because using asynchronous recognition with the original sampling rate is recommended for longer audio files, allowing for better performance and handling of larger workloads. Option A is incorrect as synchronous recognition is not ideal for lengthy recordings, while options C and D suggest unnecessary upsampling of the audio which is not needed in this context.