AWS Certified Solutions Architect – Associate (SAA-C03) — Question 579
A research company uses on-premises devices to generate data for analysis. The company wants to use the AWS Cloud to analyze the data. The devices generate .csv files and support writing the data to an SMB file share. Company analysts must be able to use SQL commands to query the data. The analysts will run queries periodically throughout the day.
Which combination of steps will meet these requirements MOST cost-effectively? (Choose three.)
Answer options
- A. Deploy an AWS Storage Gateway on premises in Amazon S3 File Gateway mode.
- B. Deploy an AWS Storage Gateway on premises in Amazon FSx File Gateway made.
- C. Set up an AWS Glue crawler to create a table based on the data that is in Amazon S3.
- D. Set up an Amazon EMR cluster with EMR File System (EMRFS) to query the data that is in Amazon S3. Provide access to analysts.
- E. Set up an Amazon Redshift cluster to query the data that is in Amazon S3. Provide access to analysts.
- F. Setup Amazon Athena to query the data that is in Amazon S3. Provide access to analysts.
Correct answer: A, C, F
Explanation
To transfer SMB-compatible data directly to Amazon S3 cost-effectively, Amazon S3 File Gateway (Option A) is the ideal choice, whereas Amazon FSx File Gateway (Option B) is designed for Windows-centric FSx file systems and is less cost-effective here. Using an AWS Glue crawler (Option C) allows the system to automatically discover the schema of the .csv files in Amazon S3 and create a table definition. Finally, Amazon Athena (Option F) provides a serverless, pay-per-query SQL engine perfect for periodic queries, avoiding the high, continuous costs associated with running active Amazon EMR (Option D) or Amazon Redshift (Option E) clusters.