You need to use TensorFlow to train an image classification model. Your dataset is locate…

Question

You need to use TensorFlow to train an image classification model. Your dataset is located in a Cloud Storage directory and contains millions of labeled images. Before training the model, you need to prepare the data. You want the data preprocessing and model training workflow to be as efficient, scalable, and low maintenance as possible. What should you do?

Accepted Answer

Correct answer: A. A. 1. Create a Dataflow job that creates sharded TFRecord files in a Cloud Storage directory.
2. Reference tf.data.TFRecordDataset in the training script.
3. Train the model by using Vertex AI Training with a V100 GPU. — Option A is the correct choice because it effectively utilizes a Dataflow job to create sharded TFRecord files, which is optimal for large datasets. Options B, C, and D involve unnecessary steps or less efficient methods for organizing data, which can complicate the workflow and increase maintenance efforts.

Google Cloud Professional Machine Learning Engineer — Question 228

Answer options

Correct answer: A

Explanation