Correlation
In this section, we will cover correlation – what does correlation mean, and how do we check the correlation between the DEFAULT
column and other columns in our dataset?
Correlation measures the degree of dependency between any two variables. Say, for example, we have two variables, A and B. If the value of B increases when the value of A is increased, we say the variables are positively correlated. On the other hand, if the value of B decreases when we increase the value of A, we say the variables are negatively correlated. There could also be a situation where an increase in the value of A doesn't affect the value of B, for which we say the variables are uncorrelated.
The value of a correlation coefficient can vary between -1 to 1, with 1 being a strong positive correlation and -1 a strong negative correlation.
By studying the correlation between the DEFAULT
column and other columns with the help of a heatmap, we can figure out which column/variable...