Finding which variables in a dataset are most “similar”, in some way, to the outcome variable of interest can be a very useful first step in understanding the dataset and planning the next steps. The R package ClustOfVar provides an implementation for this purpose in the mixedVarSim() function.
Check out the full workflow at my portfolio!









