Python 2.6 Text Processing: Beginners Guide

With a basic knowledge of Python you have the potential to undertake time-saving text processing. This book is a great introduction to the various techniques, and teaches through practical examples and clear explanations.

Python 2.6 Text Processing: Beginners Guide

Jeff McNeil

2 customer reviews
With a basic knowledge of Python you have the potential to undertake time-saving text processing. This book is a great introduction to the various techniques, and teaches through practical examples and clear explanations.
Packt Subscription
FREE
$8.33/m after trial
eBook
$18.90
RRP $26.99
Save 29%
Print + eBook
$44.99
RRP $44.99
What do I get with a Packt subscription?
  • Exclusive monthly discount - no contract
  • Unlimited access to entire Packt library of 6500+ eBooks and Videos
  • 120 new titles added every month, on new and emerging tech
What do I get with an eBook?
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the subscription reader
What do I get with Print & eBook?
  • Get a paperback copy of the book delivered to you
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the subscription reader
What do I get with a Video?
  • Download this Video course in MP4 format
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the subscription reader
$0.00
$18.90
$44.99
$0 p/m after trial
RRP $26.99
RRP $44.99
Subscription
eBook
Print + eBook
Start 10 Day Trial

Frequently bought together


Python 2.6 Text Processing: Beginners Guide Book Cover
Python 2.6 Text Processing: Beginners Guide
$ 26.99
$ 18.90
Python Text Processing with NLTK 2.0 Cookbook Book Cover
Python Text Processing with NLTK 2.0 Cookbook
$ 23.99
$ 4.80
Buy 2 for $22.30
Save $28.68
Add to Cart

Book Details

ISBN 139781849512121
Paperback380 pages

Book Description

For programmers, working with text is not about reading their newspaper on a break; it's about taking textual data in one form and doing something to it. Extract, decrypt, parse, restructure – these are just some of the text tasks that can occupy much of a programmer's life. If this is your life, this book will make it better – a practical guide on how to do what you want with textual data in Python.

Python 2.6 Text Processing Beginner's Guide is the easiest way to learn how to manipulate text with Python. Packed with examples, it will teach you text processing techniques and give you the skills to work with the most popular Python libraries for transforming text from one form to another.

The book gets you going with a quick look at some data formats, and installing the supporting libraries and components so that you're ready to get started. You move on to extracting text from a collection of sources and handling it using Python's built-in string functions and regular expressions. You look into processing structured text documents such as XML and HTML, JSON, and CSV. Then you progress to generating documents and creating templates. Finally you look at ways to enhance text output via a collection of third-party packages such as Nucular, PyParsing, NLTK, and Mako.

Table of Contents

What You Will Learn

  • Know the options available for processing text in Python
  • Parse JSON data that is often used as a data delivery mechanism on the Internet
  • Organize a log-processing application via modules and packages to make it more extensible
  • Perform conditional matches via look-ahead and look-behind assertions by using basic regular expressions
  • Process XML and HTML documents in a variety of ways based on the needs of your application
  • Implement callback methods to perform SAX processing and walk in-memory DOM structures
  • Understand Unicode, character encoding, internationalization, and localization
  • Lay out a Mako template-based project by using techniques such as template inheritance, additional tags, and custom filters
  • Install and use the Mako templating system to create your own Mako templates
  • Process a large number of e-mail messages using the Python standard library and index them with Nucular for fast searching
  • Fix common exceptions that occur while dealing with different types of text encoding
  • Build simple PDF output using the ReportLab toolkit's high-level PLATYPUS framework
  • Generate Microsoft Excel output using the xlwt module
  • Open and edit existing Open Document files to use them as template sources
  • Understand supporting functions and classes, such as the Python IO system and packaging components

Authors

Table of Contents

Book Details

ISBN 139781849512121
Paperback380 pages
Read More
From 2 reviews

Read More Reviews

Recommended for You

Python Text Processing with NLTK 2.0 Cookbook Book Cover
Python Text Processing with NLTK 2.0 Cookbook
$ 23.99
$ 4.80
Tcl/Tk 8.5 Programming Cookbook Book Cover
Tcl/Tk 8.5 Programming Cookbook
$ 23.99
$ 16.80
matplotlib Plotting Cookbook Book Cover
matplotlib Plotting Cookbook
$ 26.99
$ 18.90
matplotlib Plotting Cookbook Book Cover
matplotlib Plotting Cookbook
$ 26.99
$ 18.90
Apache Tomcat 7 Essentials Book Cover
Apache Tomcat 7 Essentials
$ 23.99
$ 16.80
Ext JS 4 First Look Book Cover
Ext JS 4 First Look
$ 26.99
$ 18.90