Get your Java environment set up and running in an Instant using Packt’s new eBook

October 2013 | Web Development

Packt is pleased to announce the release of its new book Instant Web Scraping with Java by Ryan Mitchell. This helpful guide will take you step by step through setting up your Java environment. The book comes in at over 72 pages is available in all the popular eBook formats, competitively priced at $12.74.

About the Author:

Ryan Mitchell has ten years of programming experience, including Java, C, Perl, PHP, and Python. In addition to “traditional” programming, she specializes in web technologies, with 3 years of Drupal development experience, and is Sitecore developer certified. She graduated from the Olin College of Engineering and is currently studying for a Master's degree in Software Engineering at the Harvard University Extension School. She has also worked as a developer for Harvard University and Abine Inc.

Web scraping is the process of automatically collecting information from the World Wide Web. It is a field of active developments sharing a common goal with the semantic web vision, an ambitious initiative that still requires breakthroughs in text processing, semantic understanding, artificial intelligence and human-computer interactions. Web scraping favors practical solutions based on existing technologies that are often entirely ad hoc.

Instant Web Scraping with Java will teach readers to write simple web scrapers and distributed networks of crawlers. Readers will learn how to build their own web scrapers using real-world scraping examples that collect and store data from Wikipedia, public records data sites, and IP address geolocation services. Short, concise recipes of practical instructions will show readers how to run scrapers across multiple servers, run them in parallel, and subvert common methods of anti-scraper security used on modern websites. Instant Web Scraping with Java will show readers how to view and collect any Internet data at the speed of their processor!

Packt is one of the most prolific and fast-growing tech book publishers in the world. Originally focused on open source software, Packt books focuses on practicality, recognising that readers are ultimately concerned with getting the job done. Packt’s digitally-focused business model allows to publish up-to-date books in very specific areas.


Instant Web Scraping with Java
Get your Java environment set up and running

For more information, please visit: http://www.packtpub.com/web-scraping-with-java/book

Code Download and Errata
Packt Anytime, Anywhere
Register Books
Print Upgrades
eBook Downloads
Video Support
Contact Us
Awards Voting Nominations Previous Winners
Judges Open Source CMS Hall Of Fame CMS Most Promising Open Source Project Open Source E-Commerce Applications Open Source JavaScript Library Open Source Graphics Software
Resources
Open Source CMS Hall Of Fame CMS Most Promising Open Source Project Open Source E-Commerce Applications Open Source JavaScript Library Open Source Graphics Software