Google Cloud Professional Machine Learning Engineer — Question 227

You are using Keras and TensorFlow to develop a fraud detection model. Records of customer transactions are stored in a large table in BigQuery. You need to preprocess these records in a cost-effective and efficient way before you use them to train the model. The trained model will be used to perform batch inference in BigQuery. How should you implement the preprocessing workflow?

Answer options

Correct answer: C

Explanation

Option C is the correct answer because performing preprocessing directly in BigQuery using SQL is efficient and leverages the powerful capabilities of BigQuery for handling large datasets. The other options involve additional steps and external services that may increase complexity and cost, making them less optimal for this scenario.