Google Cloud Professional Cloud Developer — Question 59
Your company is planning to migrate their on-premises Hadoop environment to the cloud. Increasing storage cost and maintenance of data stored in HDFS is a major concern for your company. You also want to make minimal changes to existing data analytics jobs and existing architecture.
How should you proceed with the migration?
Answer options
- A. Migrate your data stored in Hadoop to BigQuery. Change your jobs to source their information from BigQuery instead of the on-premises Hadoop environment.
- B. Create Compute Engine instances with HDD instead of SSD to save costs. Then perform a full migration of your existing environment into the new one in Compute Engine instances.
- C. Create a Cloud Dataproc cluster on Google Cloud Platform, and then migrate your Hadoop environment to the new Cloud Dataproc cluster. Move your HDFS data into larger HDD disks to save on storage costs.
- D. Create a Cloud Dataproc cluster on Google Cloud Platform, and then migrate your Hadoop code objects to the new cluster. Move your data to Cloud Storage and leverage the Cloud Dataproc connector to run jobs on that data.
Correct answer: D
Explanation
The correct answer is D because it allows you to migrate your Hadoop code to a cloud-native environment while keeping the existing data processing workflows intact. This approach minimizes changes to your architecture and leverages Cloud Storage for cost-effective data management. Options A and B involve significant changes that could disrupt existing jobs, while option C does not take full advantage of Cloud Storage and may not fully address the cost concerns.