Google Cloud Professional Machine Learning Engineer — Question 15

You have been asked to develop an input pipeline for an ML training model that processes images from disparate sources at a low latency. You discover that your input data does not fit in memory. How should you create a dataset following Google-recommended best practices?

Answer options

Correct answer: D

Explanation

The correct answer is D because converting images into TFRecords and storing them in Cloud Storage allows for efficient data handling and retrieval, which is crucial for large datasets that cannot fit into memory. Options A, B, and C do not address the issue of memory constraints effectively and are not aligned with best practices for handling large-scale input data.