About this video
Data is an incredible asset, especially when there are lots of it. Exploratory data analysis, business intelligence, and machine learning all depend on processing and analyzing Big Data at scale.
How do you go from working on prototypes on your local machine, to handling messy data in production and at scale?
This is a practical, hands-on course that shows you how to use Spark and it's Python API to create performant analytics with large-scale data. Don't reinvent the wheel, and wow your clients by building robust and responsible applications on Big Data.
All the code and supporting files for this course are available on Github at - https://github.com/PacktPublishing/Hands-On-Pyspark-for-Big-Data-Analysis
Style and Approach
This hands-on course is divided into clear bite-size chunks so you can learn at your own pace and focus on the areas of most interest to you. It’s practical and packed with step-by-step instructions, working examples, and helpful advice from our expert author. You will learn how PySpark provides an easy to use, performant way to do data analysis with Big Data.
- Publication date:
- December 2018
- 1 hours 52 minutes