EMC Proven Professional – Data Science and Big Data Analytics — Question 18
The web analytics team uses Hadoop to process access logs. They now want to correlate this data with structured user data residing in their massively parallel database. Which tool should they use to export the structured data from Hadoop?
Answer options
- A. Sqoop
- B. Pig
- C. Chukwa
- D. Scribe
Correct answer: A
Explanation
Sqoop is specifically designed for transferring data between Hadoop and structured data stores, making it the ideal choice for exporting structured data. Pig is a high-level platform for creating programs that run on Hadoop but does not specialize in data export. Chukwa is used for data collection and monitoring, while Scribe is for logging data, neither of which serve the purpose of exporting structured data from Hadoop.