Google Cloud Professional Machine Learning Engineer — Question 297
You lead a data science team that is working on a computationally intensive project involving running several experiments. Your team is geographically distributed and requires a platform that provides the most effective real-time collaboration and rapid experimentation. You plan to add GPUs to speed up your experimentation cycle, and you want to avoid having to manually set up the infrastructure. You want to use the Google-recommended approach. What should you do?
Answer options
- A. Configure a managed Dataproc cluster for large-scale data processing. Configure individual Jupyter notebooks on VMs that each team member uses for experimentation and model development.
- B. Use Colab Enterprise with Cloud Storage for data management. Use a Git repository for version control.
- C. Use Vertex AI Workbench and Cloud Storage for data management. Use a Git repository for version control.
- D. Configure a distributed JupyterLab instance that each team member can access on a Compute Engine VM. Use a shared code repository for version control.
Correct answer: B
Explanation
The correct answer, B, suggests using Colab Enterprise with Cloud Storage, which is optimal for real-time collaboration and supports rapid experimentation with minimal infrastructure setup. Options A and D involve manual setups that do not align with the need for automation and collaboration, while option C, while effective, does not provide the same level of collaboration features as Colab Enterprise.