关键词:Sqoop;大数据管道;NoSQ;RDBMS
摘 要:Sqoop is an integral part of the Hadoop ecosystem, helping transfer the data between NoSQL data storage and the traditional RDBMS. Numerous technical articles have been published featuring the Sqoop command-line interface usage. However, as of Sqoop 1.4.3, there is not much insight publicly available on the usage of the Sqoop Java? API. This article covers the usage of the Sqoop CLI with additional emphasis on the Sqoop Java API, using an example of data from the Bombay Stock Exchange. The article is intended to provide preliminary exposure to technical architects, solution architects, technical managers, consultants, data scientists, technical leads, and developers interested in and working in the big data space.