Afsnitsforfatter: Danielle J. Navarro and David R. Foxcroft

The “role” of variables: predictors and outcomes

I have got one last piece of terminology that I need to explain to you before moving away from variables. Normally, when we do some research we end up with lots of different variables. Then, when we analyse our data, we usually try to explain some of the variables in terms of some of the other variables. It is important to keep the two roles “thing doing the explaining” and “thing being explained” distinct. So let us be clear about this now. First, we might as well get used to the idea of using mathematical symbols to describe variables, since it is going to happen over and over again. Let us denote the “to be explained” variable Y, and denote the variables “doing the explaining” as X_1, X_2, etc.

When we are doing an analysis we have different names for X and Y, since they play different roles in the analysis. The classical names for these roles are independent variable (IV) and dependent variable (DV). The IV is the variable that you use to do the explaining (i.e., X) and the DV is the variable being explained (i.e., Y). The logic behind these names goes like this: if there really is a relationship between X and Y then we can say that Y depends on X, and if we have designed our study “properly” then X is not dependent on anything else. However, I personally find those names horrible. They are hard to remember and they are highly misleading because (a) the IV is never actually “independent of everything else”, and (b) if there is no relationship then the DV does not actually depend on the IV. And in fact, because I am not the only person who thinks that IV and DV are just awful names, there are a number of alternatives that I find more appealing. The terms that I will use in this book are predictors and outcomes. The idea here is that what you are trying to do is use X (the predictors) to make guesses about Y (the outcomes).[1] This is summarised in tabel 2.

tabel 2 The terminology used to distinguish between different roles that a variable can play when analysing a data set. Note that this book will tend to avoid the classical terminology in favour of the newer names.

role of the variable

classical name

modern name

“to be explained”

dependent variable (DV)

outcome

“to do the explaining”

independent variable (IV)

predictor