Autor des Abschnitts: Danielle J. Navarro and David R. Foxcroft

Sonstige Themen

Fehlende Daten. Angenommen, Sie führen eine Umfrage durch und interessieren sich für Bewegung und Gewicht. Sie senden Daten an vier Personen. Adam sagt, er treibe viel Sport und sei nicht übergewichtig. Briony sagt, sie treibe viel Sport und sei nicht übergewichtig. Carol sagt, sie treibe keinen Sport und sei übergewichtig. Tim sagt, er treibe keinen Sport und weigert sich, die Frage nach seinem Gewicht zu beantworten. Elaine sendet die Umfrage nicht zurück. Sie haben nun ein Problem mit fehlenden Daten. Es fehlt eine ganze Umfrage und eine Frage in einer anderen Umfrage. Was tun Sie? Das Ignorieren fehlender Daten ist im Allgemeinen keine gute Idee. Betrachten wir hier die Umfrage von Tim. Zunächst fällt auf, dass er aufgrund seiner anderen Antworten mehr Ähnlichkeit mit Carol (beide trainieren nicht) als mit Adam oder Briony zu haben scheint. Wenn Sie also gezwungen wären, sein Gewicht zu schätzen, würden Sie vermuten, dass er Carol ähnlicher ist als den beiden Anderen. Vielleicht würden Sie die Tatsache korrigieren, dass Adam und Tim männlich und Briony und Carol weiblich sind. Der statistische Name für diese Art des Ratens ist „Imputation“. Das sichere Durchführen von Imputationen ist schwierig, aber wichtig, vor allem, wenn die fehlenden Daten systematisch fehlen. Da übergewichtige Menschen sich oft unter Druck gesetzt fühlen, wegen ihres Gewichts besorgt zu sein, haben wir Grund zu der Annahme, dass die Personen, die nicht antworten, mit größerer Wahrscheinlichkeit übergewichtig sind als die Personen, die antworten. Wenn wir Tim ein Gewicht zuschreiben, bedeutet dies, dass die Zahl der übergewichtigen Personen in der Stichprobe wahrscheinlich von 1 von 3 (wenn wir Tim ignorieren) auf 2 von 4 (wenn wir Tims Gewicht zuschreiben) ansteigt. Das ist natürlich wichtig. Aber eine sinnvolle Vorgehensweise ist komplizierter als es klingt. Vorhin habe ich vorgeschlagen, Tim wie Carol zu behandeln, da beide die gleiche Antwort auf die Übungsfrage gegeben haben. Aber das ist nicht ganz richtig. Es gibt einen systematischen Unterschied zwischen den beiden. Sie hat die Frage beantwortet, Tim aber nicht. Ist es angesichts des sozialen Drucks, dem übergewichtige Menschen ausgesetzt sind, nicht wahrscheinlich, dass Tim stärker übergewichtig ist als Carol? Und natürlich wird dabei immer noch die Tatsache ignoriert, dass es nicht sinnvoll ist, Tim einen festen Wert für sein Gewicht zuzuschreiben, so als ob man sein Gewicht tatsächlich kennen würde. Stattdessen müssen Sie eine Reihe von plausiblen Schätzungen unterstellen (als multiple Imputation bezeichnet), um der Tatsache Rechnung zu tragen, dass Sie über das Gewicht von Tim unsicherer sind als über das von Carol. Und lassen Sie uns gar nicht erst mit dem Problem anfangen, das sich aus der Tatsache ergibt, dass Elaine die Umfrage nicht beantwortet hat. Wie Sie sich wahrscheinlich denken können, ist der Umgang mit fehlenden Daten ein wichtiges Thema. Man hat mir sogar gesagt, dass viele Zeitschriften in einigen Bereichen Studien mit fehlenden Daten nur dann akzeptieren, wenn ein vernünftiges Schema für die mehrfache Imputation befolgt wird.
Power analysis. In chapter Das Überprüfen von Hypothesen I discussed the concept of power (i.e., how likely are you to be able to detect an effect if it actually exists) and referred to power analysis, a collection of tools that are useful for assessing how much power your study has. Power analysis can be useful for planning a study (e.g., figuring out how large a sample you are likely to need), but it also serves a useful role in analysing data that you already collected. For instance, suppose you get a significant result, and you have an estimate of your effect size. You can use this information to estimate how much power your study actually had. This is kind of useful, especially if your effect size is not large. For instance, suppose you reject the null hypothesis at p < 0.05, but you use power analysis to figure out that your estimated power was only 0.08. The significant result means that, if the null hypothesis was in fact true, there was a 5% chance of getting data like this. But the low power means that, even if the null hypothesis is false and the effect size was really as small as it looks, there was only an 8% chance of getting data like you did. This suggests that you need to be pretty cautious, because luck seems to have played a big part in your results, one way or the other!
Data analysis using theory-inspired models. In a few places, I have mentioned response time (RT) data, where you record how long it takes someone to do something (e.g., make a simple decision). I have mentioned that RT data are almost invariably non-normal, and positively skewed. Additionally, there is a thing known as the speed-accuracy trade-off: if you try to make decisions too quickly (low RT) then you are likely to make poorer decisions (lower accuracy). So if you measure both the accuracy of a participant’s decisions and their RT, you will probably find that speed and accuracy are related. There is more to the story than this, of course, because some people make better decisions than others regardless of how fast they are going. Moreover, speed depends on both cognitive processes (i.e., time spent thinking) but also physiological ones (e.g., how fast can you move your muscles). It is starting to sound like analysing this data will be a complicated process. And indeed it is, but one of the things that you find when you dig into the psychological literature is that there already exist mathematical models (called “sequential sampling models”) that describe how people make simple decisions, and these models take into account a lot of the factors I mentioned above. You will not find any of these theoretically-inspired models in a standard statistics textbook. Standard textbooks describe standard tools, tools that could meaningfully be applied in lots of different disciplines, not just psychology. ANOVA is an example of a standard tool that is just as applicable to psychology as to pharmacology. Sequential sampling models are not, they are psychology-specific, more or less. This does not make them less powerful tools. In fact, if you are analysing data where people have to make choices quickly you should really be using sequential sampling models to analyse the data. Using ANOVA or regression or whatever will not work as well, because the theoretical assumptions that underpin them are not well-matched to your data. In contrast, sequential sampling models were explicitly designed to analyse this specific type of data, and their theoretical assumptions are extremely well-matched to the data.