AWS Certified Data Engineer – Associate (DEA-C01) — Question 89
A manufacturing company has many IoT devices in facilities around the world. The company uses Amazon Kinesis Data Streams to collect data from the devices. The data includes device ID, capture date, measurement type, measurement value, and facility ID. The company uses facility ID as the partition key.
The company's operations team recently observed many WriteThroughputExceeded exceptions. The operations team found that some shards were heavily used but other shards were generally idle.
How should the company resolve the issues that the operations team observed?
Answer options
- A. Change the partition key from facility ID to a randomly generated key.
- B. Increase the number of shards.
- C. Archive the data on the producer's side.
- D. Change the partition key from facility ID to capture date.
Correct answer: A
Explanation
The correct answer is A because changing the partition key to a randomly generated key can help distribute the load evenly across all shards, preventing bottlenecks that lead to WriteThroughputExceeded exceptions. Increasing the number of shards (option B) may not solve the underlying issue of uneven data distribution. Archiving data on the producer's side (option C) does not address the immediate problem of exceeding write throughput, and changing the partition key to capture date (option D) could still result in uneven shard usage based on the nature of the data being collected.