AWS Certified Solutions Architect – Associate (SAA-C03) — Question 787
A company has 5 TB of datasets. The datasets consist of 1 million user profiles and 10 million connections. The user profiles have connections as many-to-many relationships. The company needs a performance efficient way to find mutual connections up to five levels.
Which solution will meet these requirements?
Answer options
- A. Use an Amazon S3 bucket to store the datasets. Use Amazon Athena to perform SQL JOIN queries to find connections.
- B. Use Amazon Neptune to store the datasets with edges and vertices. Query the data to find connections.
- C. Use an Amazon S3 bucket to store the datasets. Use Amazon QuickSight to visualize connections.
- D. Use Amazon RDS to store the datasets with multiple tables. Perform SQL JOIN queries to find connections.
Correct answer: B
Explanation
Amazon Neptune is a purpose-built graph database optimized for storing and querying highly connected datasets using vertices and edges, which makes it highly efficient for traversing many-to-many relationships up to five levels deep. Traditional relational databases (Amazon RDS) and query engines (Amazon Athena) struggle with performance when executing recursive or deeply nested SQL JOINs for multi-level relationship traversal. Amazon QuickSight is a visualization tool and lacks the querying capability to compute these complex connection paths.