Technical requirements
In this chapter, we will use the pandas, matplotlib, and scikit-learn Python libraries. We will also use NLTK, a comprehensive Python library for NLP and text analysis. You can find the instructions to install NLTK at http://www.nltk.org/install.html.
If you are using the Python Anaconda distribution, follow the instructions to install NLTK at https://anaconda.org/anaconda/nltk.
After you have installed NLTK, open up a Python console and execute the following:
import nltk
nltk.download('punkt')
nltk.download('stopwords')			These commands will download the necessary data for you to be able to run the recipes in this chapter successfully.
Note
If you haven’t downloaded these or the other data sources necessary for NLTK functionality, NLTK will raise an error. Read the error message carefully because it will direct you to download the data required to run the command that you are trying to execute.
 
                                             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
     
         
                 
                 
                 
                 
                 
                 
                 
                 
                