AWS Certified Data Analytics – Specialty — Question 23

A company that monitors weather conditions from remote construction sites is setting up a solution to collect temperature data from the following two weather stations.
✑ Station A, which has 10 sensors
✑ Station B, which has five sensors
These weather stations were placed by onsite subject-matter experts.
Each sensor has a unique ID. The data collected from each sensor will be collected using Amazon Kinesis Data Streams.
Based on the total incoming and outgoing data throughput, a single Amazon Kinesis data stream with two shards is created. Two partition keys are created based on the station names. During testing, there is a bottleneck on data coming from Station A, but not from Station B. Upon review, it is confirmed that the total stream throughput is still less than the allocated Kinesis Data Streams throughput.
How can this bottleneck be resolved without increasing the overall cost and complexity of the solution, while retaining the data collection quality requirements?

Answer options

Correct answer: C

Explanation

The correct answer is C because modifying the partition key to use the sensor ID allows for a more granular distribution of data across the shards, alleviating the bottleneck. Options A and B would increase the complexity and cost of the solution, while D would reduce the data collection capability, which does not meet the quality requirements.