Dataset overview
Let’s explore the dataset we’re about to use. Like we did in other chapters on machine learning, we start with a dataset, this one being a dataset of Amazon book reviews.
The dataset contains information about different products and their reviews. It includes the following columns:
marketplace(string): Location of the productcustomer_id(string): Unique ID of the customerreview_id(string): Review IDproduct_id(string): Unique ID of the productproduct_parent(string): Parent productproduct_title(string): Title of the product reviewedproduct_category(string): Different product categoriesstar_rating(int): Rating of the product out of 5helpful_votes(int): Number of helpful votes for the producttotal_votes(int): Total number of votes for the productreview_headline(string): Heading of the reviewreview_body(string): Content of the reviewreview_date(string...