Introduction to Data Science and Data Pre-Processing
Learning Objectives
By the end of this chapter, you will be able to:
- Use various Python machine learning libraries
 - Handle missing data and deal with outliers
 - Perform data integration to bring together data from different sources
 - Perform data transformation to convert data into a machine-readable form
 - Scale data to avoid problems with values of different magnitudes
 - Split data into train and test datasets
 - Describe the different types of machine learning
 - Describe the different performance measures of a machine learning model
 
This chapter introduces data science and covers the various processes included in the building of machine learning models, with a particular focus on pre-processing.