Sqoop is a JDBC-based technology, which is used for bi-directional data transfers from Hadoop to any RDBMS solution. This opens up the scope to merge structured and unstructured data and provide powerful analytics on the data overall. The SQL Server-Hadoop Connector is a Sqoop implementation, which is specifically designed for data transfer between SQL Server and Hadoop. This chapter explained how to configure and install Sqoop on your Hadoop NameNode and execute sample import
/export
commands to move data to and from SQL Server and Hadoop. In the next chapter, you will learn to consume Hadoop data through another Apache supporting project called Hive. You would also learn how to use the client-side Hive ODBC driver to consume Hive data from Business Intelligence tools for example, SQL Server Integration Services.
You're reading from Microsoft SQL Server 2012 with Hadoop
The rest of the page is locked
You have been reading a chapter from
Microsoft SQL Server 2012 with HadoopPublished in: Aug 2013Publisher: PacktISBN-13: 9781782177982
© 2013 Packt Publishing Limited All Rights Reserved