Dealing with data
Typically, when you deal with data, this is the path you go through: you fetch it, you clean and manipulate it, then you inspect it and present results as values, spreadsheets, graphs, and so on. I want you to be in charge of all three steps of the process without having any external dependency on a data provider, so we're going to do the following:
- We're going to create the data, simulating the fact that it comes in a format which is not perfect or ready to be worked on. 
- We're going to clean it and feed it to the main tool we'll use in the project: DataFrame of - pandas.
- We're going to manipulate the data in the DataFrame. 
- We're going to save the DataFrame to a file in different formats. 
- Finally, we're going to inspect the data and get some results out of it. 
Setting up the notebook
First things first, we need to set up the notebook. This means imports and a bit of configuration.
#1
import json import calendar import random from datetime import date, timedelta import faker import...
 
                                             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
     
         
                 
                 
                 
                 
                 
                 
                 
                 
                