Reader small image

You're reading from  The Applied Data Science Workshop - Second Edition

Product typeBook
Published inJul 2020
Reading LevelIntermediate
PublisherPackt
ISBN-139781800202504
Edition2nd Edition
Languages
Tools
Concepts
Right arrow
Author (1)
Alex Galea
Alex Galea
author image
Alex Galea

Alex Galea has been professionally practicing data analytics since graduating with a masters degree in physics from the University of Guelph, Canada. He developed a keen interest in Python while researching quantum gases as part of his graduate studies. Alex is currently doing web data analytics, where Python continues to play a key role in his work. He is a frequent blogger about data-centric projects that involve Python and Jupyter Notebooks.
Read more about Alex Galea

Right arrow

Approaching Data Science Problems

It's important to ensure you have a well-structured plan for your data science project before you start the analysis and modeling phases. We'll outline some factors to keep in mind when making this plan, and then go over some technical details regarding preparing data for modeling in the next section.

Since this book is centered around Jupyter Notebooks, we'll start by highlighting how useful they are for the planning phase of a data science project. They offer a very convenient medium for documenting your analysis and modeling plans, for example, by writing rough notes about the data or a list of models we are interested in training. Having these notes in the same place as your proceeding analysis can help others understand what you're doing when they see your work or provide context for you when you look back after leaving it for a while.

A large part of data science involves the use of machine learning to build predictive...

lock icon
The rest of the page is locked
Previous PageNext Page
You have been reading a chapter from
The Applied Data Science Workshop - Second Edition
Published in: Jul 2020Publisher: PacktISBN-13: 9781800202504

Author (1)

author image
Alex Galea

Alex Galea has been professionally practicing data analytics since graduating with a masters degree in physics from the University of Guelph, Canada. He developed a keen interest in Python while researching quantum gases as part of his graduate studies. Alex is currently doing web data analytics, where Python continues to play a key role in his work. He is a frequent blogger about data-centric projects that involve Python and Jupyter Notebooks.
Read more about Alex Galea