AWS Certified Big Data – Specialty — Question 54
An administrator receives about 100 files per hour into Amazon S3 and will be loading the files into Amazon
Redshift. Customers who analyze the data within Redshift gain significant value when they receive data as quickly as possible. The customers have agreed to a maximum loading interval of 5 minutes.
Which loading approach should the administrator use to meet this objective?
Answer options
- A. Load each file as it arrives because getting data into the cluster as quickly as possibly is the priority.
- B. Load the cluster as soon as the administrator has the same number of files as nodes in the cluster.
- C. Load the cluster when the administrator has an event multiple of files relative to Cluster Slice Count, or 5 minutes, whichever comes first.
- D. Load the cluster when the number of files is less than the Cluster Slice Count.
Correct answer: C
Explanation
Option C is the correct choice as it balances the frequency of loading files with the performance optimization of the Cluster Slice Count, ensuring efficient data ingestion within the 5-minute window. Option A may lead to excessive load operations, while option B and D do not guarantee timely data delivery within the required interval.