AWS Certified Big Data – Specialty — Question 11
An administrator needs to design a distribution strategy for a star schema in a Redshift cluster. The administrator needs to determine the optimal distribution style for the tables in the Redshift schema.
In which three circumstances would choosing Key-based distribution be most appropriate? (Select three.)
Answer options
- A. When the administrator needs to optimize a large, slowly changing dimension table.
- B. When the administrator needs to reduce cross-node traffic.
- C. When the administrator needs to optimize the fact table for parity with the number of slices.
- D. When the administrator needs to balance data distribution and collocation data.
- E. When the administrator needs to take advantage of data locality on a local node for joins and aggregates.
Correct answer: A, C, D
Explanation
Choosing Key-based distribution is ideal for optimizing large dimension tables (A), ensuring the fact table aligns with the number of slices (C), and maintaining a balance in data distribution (D). Options B and E, while relevant to distribution strategies, do not specifically apply to the advantages of Key-based distribution.