Data Acquisition and Manipulation with Python [Video]

More Information
Learn
  • See how to acquire and save different formats of data
  • Find out how to connect to a database and add information to it
  • Combine and merge data sets, and manipulate strings using Python
  • Aggregate your data and employ group-wise operations and transformations
  • Know how to download and read a web page using the BeautifulSoup package
  • Extract useful information from websites using Selenium in Python
  • Program and employ a Scrapy spider for successful web scraping
About

Python, a multi-paradigm programming language, has become the language of choice for data scientists for data analysis, visualization, and machine learning.

In this course, you’ll start by learning how to acquire data from the web in its already “clean” format, such as in a .csv file, or a database. You’ll then learn to transform this data so it’s in its most useful format for analysis. After that, you’ll dive into data aggregation and grouping, where you’ll learn to group similar data for easier analysis purposes.
From there, you’ll be shown different methods of web scraping using Python. Finally, you’ll learn to extract large amounts of data using BeautifulSoup, as well as work with Selenium and Scrapy.

Style and Approach

In-depth content balanced with tutorials that put theory into practice. This course will give you both the understanding and the practical experience that will allow you indulge in the art of data acquisition and manipulation using Python packages.

Features
  • Enhance your knowledge as a data analyst by diving into the necessary tools for data acquisition and manipulation 
  • Real-world, practical examples that help you wrap your head around the essential know-how for data management 
  • Covers important concepts such as data wrangling and data aggregation
Course Length 2 hours 39 minutes
ISBN 9781788291415
Date Of Publication 27 Sep 2017

Authors

Curtis Miller

Curtis Miller is a doctoral candidate at the University of Utah studying mathematical statistics. He writes software for both research and personal interest, including the R package (CPAT) available on the Comprehensive R Archive Network (CRAN). Among Curtis Miller's publications are academic papers along with books and video courses all published by Packt Publishing. Curtis Miller's video courses include Unpacking NumPy and Pandas, Data Acquisition and Manipulation with Python, Training Your Systems with Python Statistical Modelling, and Applications of Statistical Learning with Python. His books include Hands-On Data Analysis with NumPy and Pandas.