Selecting the features for classifying the Zoo dataset
The UCI Machine Learning Repository (https://archive.ics.uci.edu/) maintains over 600 datasets as a service to the machine learning community. These datasets can be used for experimentation with various models and algorithms. A typical dataset contains a number of features (inputs) and the desired output, in theform of columns, with a description of their meaning.
In this section, we will use the UCI Zoo dataset (https://archive.ics.uci.edu/dataset/111/zoo). This dataset describes 101 different animals using the following 18 features:
| 
							 No.  | 
						
							 Feature Name  | 
						
							 Data Type  | 
					
| 
							 1  | 
						
							 animal name  | 
						
							 unique for each instance  | 
					
| 
							 2  | 
						
							 hair  | 
						
							 boolean  | 
					
| 
							 3  | 
						
							 feathers ... |