Technical requirements
In this chapter, we will use pandas, matplotlib, and featuretools open source Python libraries. You can install featuretools with pip:
pip install featuretools
Additionally, you can do so with conda:
conda install -c conda-forge featuretools
These commands install the basic featuretools functionality, but we can install add-ons for creating features with natural language processing (NLP) or for using dask as the backend instead of pandas. For more information on how to install featuretools add-ons, including graphviz, check out their documentation here: https://docs.featuretools.com/en/v0.16.0/getting_started/install.html.
We will work with the Online Retail II dataset from the UCI Machine Learning Repository, which is available at https://archive.ics.uci.edu/ml/datasets/Online+Retail+II and licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) license: https://creativecommons.org/licenses/by/4.0/legalcode. The corresponding...
 
                                             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
     
         
                 
                 
                 
                 
                 
                 
                 
                 
                