Reader small image

You're reading from  Microsoft SQL Server 2012 with Hadoop

Product typeBook
Published inAug 2013
Reading LevelIntermediate
PublisherPackt
ISBN-139781782177982
Edition1st Edition
Languages
Right arrow
Author (1)
Debarchan Sarkar
Debarchan Sarkar
author image
Debarchan Sarkar

Debarchan Sarkar is a Microsoft Data Platform engineer. He specializes in the Microsoft SQL Server Business Intelligence stack. Debarchan is a subject matter expert in SQL Server Integration Services and delves deep into the open source world, specifically the Apache Hadoop framework. He is currently working on a technology called HDInsight, which is Microsoft's distribution of Hadoop on Windows. He has authored various books on SQL Server and Big Data, including Microsoft SQL Server 2012 with Hadoop, Packt Publishing, and Pro Microsoft HDInsight: Hadoop on Windows, Apress. His Twitter handle is @debarchans.
Read more about Debarchan Sarkar

Right arrow

The Sqoop import tool


You're now ready to use SQL Server-Hadoop Connector and import data from SQL Server 2012 to HDFS. The input to the import process is a SQL Server table, which will be read row-by-row into HDFS by Sqoop. The output of this import process is a set of files containing a copy of the imported table. Since the import process is performed in parallel, the output will be in multiple files.

When using the sqoop import command, you must specify the following mandatory arguments:

  • --connect argument specifying the connection string to the SQL Server database

  • --username and --password arguments to provide valid credentials to connect to the SQL Server database

  • --table or --query argument to import an entire table or results of a custom query execution

The following command imports data from ErrorLog table in SQL Server Adventureworks2012 database to delimited text files in /data/ErrorLogs directory on HDFS.

Note

Sqoop 1.4.2 does not recognize SQL Server tables, which do not belong to...

lock icon
The rest of the page is locked
Previous PageNext Page
You have been reading a chapter from
Microsoft SQL Server 2012 with Hadoop
Published in: Aug 2013Publisher: PacktISBN-13: 9781782177982

Author (1)

author image
Debarchan Sarkar

Debarchan Sarkar is a Microsoft Data Platform engineer. He specializes in the Microsoft SQL Server Business Intelligence stack. Debarchan is a subject matter expert in SQL Server Integration Services and delves deep into the open source world, specifically the Apache Hadoop framework. He is currently working on a technology called HDInsight, which is Microsoft's distribution of Hadoop on Windows. He has authored various books on SQL Server and Big Data, including Microsoft SQL Server 2012 with Hadoop, Packt Publishing, and Pro Microsoft HDInsight: Hadoop on Windows, Apress. His Twitter handle is @debarchans.
Read more about Debarchan Sarkar