Reader small image

You're reading from  Cloud Scale Analytics with Azure Data Services

Product typeBook
Published inJul 2021
PublisherPackt
ISBN-139781800562936
Edition1st Edition
Right arrow
Author (1)
Patrik Borosch
Patrik Borosch
author image
Patrik Borosch

Patrik Borosch is a cloud solution architect for data and AI at Microsoft Switzerland GmbH. He has more than 25 years of BI and analytics development, engineering, and architecture experience and is a Microsoft Certified Data Engineer and a Microsoft Certified AI Engineer. Patrik has worked on numerous significant international data warehouse, data integration, and big data projects. Through this, he has built and extended his experience in all facets, from requirements engineering to data modeling and ETL, all the way to reporting and dashboarding. At Microsoft Switzerland, he supports customers in their journey into the analytical world of the Azure Cloud.
Read more about Patrik Borosch

Right arrow

Integrating data with Synapse Spark pools

If you are a Spark developer and want to use Synapse Spark to wrangle and load your data into your dedicated SQL pools, this is quite an easy thing to accomplish.

JDBC was, and still is, the way to establish the connection and the exchange. There is one caveat regarding the use of JDBC; only interact with the dedicated SQL pools. It will only talk to the control node of your dedicated pool. This is a suboptimal way as both Spark, but also dedicated SQL pools, have a lot of parallelism to offer.

Microsoft adjusted the JDBC driver slightly to benefit from the parallel workers that are part of this game. The JDBC driver will establish a connection between the control node of the dedicated SQL pool and the driver node of the Spark cluster. The Spark engine will issue CETAS statements and send filters and projections over this channel. The data itself will otherwise be exchanged using PolyBase and the Data Lake storage that is attached to...

lock icon
The rest of the page is locked
Previous PageNext Page
You have been reading a chapter from
Cloud Scale Analytics with Azure Data Services
Published in: Jul 2021Publisher: PacktISBN-13: 9781800562936

Author (1)

author image
Patrik Borosch

Patrik Borosch is a cloud solution architect for data and AI at Microsoft Switzerland GmbH. He has more than 25 years of BI and analytics development, engineering, and architecture experience and is a Microsoft Certified Data Engineer and a Microsoft Certified AI Engineer. Patrik has worked on numerous significant international data warehouse, data integration, and big data projects. Through this, he has built and extended his experience in all facets, from requirements engineering to data modeling and ETL, all the way to reporting and dashboarding. At Microsoft Switzerland, he supports customers in their journey into the analytical world of the Azure Cloud.
Read more about Patrik Borosch