AWS Certified Data Engineer – Associate (DEA-C01) — Question 131
A company is creating near real-time dashboards to visualize time series data. The company ingests data into Amazon Managed Streaming for Apache Kafka (Amazon MSK). A customized data pipeline consumes the data. The pipeline then writes data to Amazon Keyspaces (for Apache Cassandra), Amazon OpenSearch Service, and Apache Avro objects in Amazon S3.
Which solution will make the data available for the data visualizations with the LEAST latency?
Answer options
- A. Create OpenSearch Dashboards by using the data from OpenSearch Service.
- B. Use Amazon Athena with an Apache Hive metastore to query the Avro objects in Amazon S3. Use Amazon Managed Grafana to connect to Athena and to create the dashboards.
- C. Use Amazon Athena to query the data from the Avro objects in Amazon S3. Configure Amazon Keyspaces as the data catalog. Connect Amazon QuickSight to Athena to create the dashboards.
- D. Use AWS Glue to catalog the data. Use S3 Select to query the Avro objects in Amazon S3. Connect Amazon QuickSight to the S3 bucket to create the dashboards.
Correct answer: A
Explanation
Option A is the correct choice because creating OpenSearch Dashboards directly uses the data from OpenSearch Service, which minimizes latency. The other options involve additional steps and services, such as querying Avro objects or cataloging data, which introduce delays compared to directly utilizing OpenSearch for real-time visualizations.