Google Cloud Professional Data Engineer — Question 15
Your company has hired a new data scientist who wants to perform complicated analyses across very large datasets stored in Google Cloud Storage and in a
Cassandra cluster on Google Compute Engine. The scientist primarily wants to create labelled data sets for machine learning projects, along with some visualization tasks. She reports that her laptop is not powerful enough to perform her tasks and it is slowing her down. You want to help her perform her tasks.
What should you do?
Answer options
- A. Run a local version of Jupiter on the laptop.
- B. Grant the user access to Google Cloud Shell.
- C. Host a visualization tool on a VM on Google Compute Engine.
- D. Deploy Google Cloud Datalab to a virtual machine (VM) on Google Compute Engine.
Correct answer: D
Explanation
The correct answer is D because Google Cloud Datalab is specifically designed for data exploration and analysis, making it ideal for the scientist's needs in machine learning and visualization tasks. Options A and B do not provide the necessary resources for handling large datasets, while option C, although helpful, does not offer the specialized capabilities that Datalab provides.