Google Cloud Associate Data Practitioner — Question 45

You need to design a data pipeline that ingests data from CSV, Avro, and Parquet files into Cloud Storage. The data includes raw user input. You need to remove all malicious SQL injections before storing the data in BigQuery. Which data manipulation methodology should you choose?

Answer options

Correct answer: C

Explanation

The correct answer is ETL because it involves extracting data, transforming it to cleanse and secure it (including removing SQL injections), and then loading it into the target storage, which is BigQuery in this case. ELT, on the other hand, loads the data first and then transforms it, which would not adequately address the need to remove malicious content before storing it. EL and ETLT do not fit the requirement for data transformation prior to loading.