Bivariate analysis
In this section, we will cover bivariate analysis to understand the combined effect of two variables as well as the effect of one variable on the other variable. In any real-life example, there will be multiple variables dependent on each other. Hence, this analysis will be useful in getting an understanding about these cases.
The best method to get a quick understanding about two variables is the scatter plot. This visual representation gives us a clear idea about the impact of one variable on the other variable. We can use the same ggplot
function to plot the scatter plot. We will plot the scatter chart to get the relationship between the Age
and Fare
variables:
ggplot(tdata, aes(x=Fare, y=Age)) + geom_point(shape=1) + geom_smooth(method=lm) ggsave(file="scatter-plot.png", dpi=500)
In the preceding case, we are plotting the relationship between these two variables along with the scatter plot, using the geom_smooth
parameter, which plots an additional linear line that...