Google Cloud Professional Data Engineer — Question 176
You have one BigQuery dataset which includes customers’ street addresses. You want to retrieve all occurrences of street addresses from the dataset. What should you do?
Answer options
- A. Write a SQL query in BigQuery by using REGEXP_CONTAINS on all tables in your dataset to find rows where the word “street” appears.
- B. Create a deep inspection job on each table in your dataset with Cloud Data Loss Prevention and create an inspection template that includes the STREET_ADDRESS infoType.
- C. Create a discovery scan configuration on your organization with Cloud Data Loss Prevention and create an inspection template that includes the STREET_ADDRESS infoType.
- D. Create a de-identification job in Cloud Data Loss Prevention and use the masking transformation.
Correct answer: B
Explanation
The correct answer is B because creating a deep inspection job with Cloud Data Loss Prevention allows you to specifically identify sensitive information types like STREET_ADDRESS across all tables in the dataset. Option A is incorrect as it only searches for the word 'street' and may miss variations. Option C is not suitable because a discovery scan is for organization-wide data discovery rather than specific dataset extraction. Option D focuses on de-identification, which does not retrieve the addresses but rather masks them.