Get hands-on solutions that will take your web scraping skills in Python to the next level
This is your one-stop solution for common and not-so-common issues while performing web scraping with Python
Understand a web page’s structure and collect meaningful data from a website with ease
Description
Python is a high-level programming language used for general-purpose programming. It has a design philosophy which emphasizes code readability and a syntax which allows programmers to express concepts in fewer lines of code than possible in languages such as C++ or Java.
This video course is a rich collection of recipes that will come in handy when you are scraping a website using Python, addressing your usual and unusual problems while scraping websites by diving deep into the capabilities of Python’sweb scraping tools such as Selenium, BeautifulSoup, and urllib2. The video will start with showing how to use selenium module for scraping by setting up a web driver, debugging with the Console and downloading files and streamlining with a Headless Browser (PhantomJS). The video will then move on to demonstrate how to do parsing with Beautifulsoup which would include introduction to the BeautifulSoupObjects, Nested Selectors and Regular Expressions Basics and how to do UTF-8 Encoding. The video will finally end by showing how to do fetching with urlib2 by using the developer tools Network tab, how to bypass the browser and retrieve files.
By The end of this video, you will be successfully able to understand the in-depth capabilities of python web scraping tools.
What you will learn
[*]Use the Selenium module and scrape with Selenium
[*]Find out how to set up a web driver
[*]Perform debugging with the console and download files
[*]Learn to work with Nested selectors and regular expression basics
[*] Discover how to perform parsing with BeautifulSoup
[*]Understand authentication with Wireshark.
[*]Master the use of URL Query Strings and HTTP Requests (GET and POST)
[*] Implement streamlining with headless browser
What do you get with a video?
Download this video in MP4 format
Access this title in our online reader with advanced features
DRM FREE - Read whenever, wherever and however you want
Charles Clayton is a sole proprietor of crclayton technologies co and an independent web developer. He is an experienced developer and Python specialist in Python web scraping solutions and tools such asSelenium, BeautifulSoup,and urllib2. He has 2 years of experience as a Reliability Engineer with West frazweer.
How can I download a video package for offline viewing?
Login to your account at Packtpub.com.
Click on "My Account" and then click on the "My Videos" tab to access your videos.
Click on the "Download Now" link to start your video download.
How can I extract my video file?
All modern operating systems ship with ZIP file extraction built in. If you'd prefer to use a dedicated compression application, we've tested WinRAR / 7-Zip for Windows, Zipeg / iZip / UnRarX for Mac and 7-Zip / PeaZip for Linux. These applications support all extension files.
How can I get help and support around my video package?
If your video course doesn't give you what you were expecting, either because of functionality problems or because the content isn't up to scratch, please mail customercare@packt.com with details of the problem. In addition, so that we can best provide the support you need, please include the following information for our support team.
Video
Format watched (HTML, MP4, streaming)
Chapter or section that issue relates to (if relevant)
System being played on
Browser used (if relevant)
Details of support
Why can’t I download my video package?
In the even that you are having issues downloading your video package then please follow these instructions:
Disable all your browser plugins and extensions: Some security and download manager extensions can cause issues during the download.
Download the video course using a different browser: We've tested downloads operate correctly in current versions of Chrome, Firefox, Internet Explorer, and Safari.