Reader small image

You're reading from  Microsoft SQL Server 2012 with Hadoop

Product typeBook
Published inAug 2013
Reading LevelIntermediate
PublisherPackt
ISBN-139781782177982
Edition1st Edition
Languages
Right arrow
Author (1)
Debarchan Sarkar
Debarchan Sarkar
author image
Debarchan Sarkar

Debarchan Sarkar is a Microsoft Data Platform engineer. He specializes in the Microsoft SQL Server Business Intelligence stack. Debarchan is a subject matter expert in SQL Server Integration Services and delves deep into the open source world, specifically the Apache Hadoop framework. He is currently working on a technology called HDInsight, which is Microsoft's distribution of Hadoop on Windows. He has authored various books on SQL Server and Big Data, including Microsoft SQL Server 2012 with Hadoop, Packt Publishing, and Pro Microsoft HDInsight: Hadoop on Windows, Apress. His Twitter handle is @debarchans.
Read more about Debarchan Sarkar

Right arrow

Summary


Sqoop is a JDBC-based technology, which is used for bi-directional data transfers from Hadoop to any RDBMS solution. This opens up the scope to merge structured and unstructured data and provide powerful analytics on the data overall. The SQL Server-Hadoop Connector is a Sqoop implementation, which is specifically designed for data transfer between SQL Server and Hadoop. This chapter explained how to configure and install Sqoop on your Hadoop NameNode and execute sample import/export commands to move data to and from SQL Server and Hadoop. In the next chapter, you will learn to consume Hadoop data through another Apache supporting project called Hive. You would also learn how to use the client-side Hive ODBC driver to consume Hive data from Business Intelligence tools for example, SQL Server Integration Services.

lock icon
The rest of the page is locked
Previous PageNext Chapter
You have been reading a chapter from
Microsoft SQL Server 2012 with Hadoop
Published in: Aug 2013Publisher: PacktISBN-13: 9781782177982

Author (1)

author image
Debarchan Sarkar

Debarchan Sarkar is a Microsoft Data Platform engineer. He specializes in the Microsoft SQL Server Business Intelligence stack. Debarchan is a subject matter expert in SQL Server Integration Services and delves deep into the open source world, specifically the Apache Hadoop framework. He is currently working on a technology called HDInsight, which is Microsoft's distribution of Hadoop on Windows. He has authored various books on SQL Server and Big Data, including Microsoft SQL Server 2012 with Hadoop, Packt Publishing, and Pro Microsoft HDInsight: Hadoop on Windows, Apress. His Twitter handle is @debarchans.
Read more about Debarchan Sarkar