280 likes | 374 Views
Quantitative Methods for Researchers. Paul Cairns paul.cairns@york.ac.uk. Objectives. Statistical argument Safe designs A whizz through some stats Time for questions. Statistical Argument. Inference is an argument form Prediction is essential Alternative hypothesis “X causes Y”
E N D
Quantitative Methods for Researchers Paul Cairns paul.cairns@york.ac.uk
Objectives • Statistical argument • Safe designs • A whizz through some stats • Time for questions
Statistical Argument • Inference is an argument form • Prediction is essential • Alternative hypothesis • “X causes Y” • No prediction – measuring noise
Gold standard argument • Collect data • Data variation could be chance (null) • Predictthe variations (alternative) • Statistics give probabilities • Unlikely predictions “prove” your case
Implications • Must have an alt (testable) hyp • No multiple testing • No post hoc analysis • Need multiple experiments
Silver standard argument • Collect data • Data variations could be chance (null) • Are there “real” patterns in the data? • Use statistics to suggest (unlikely) patterns • Follow up findings with gold standard work
Fishing: This is bad science • Collect lots of data • DVs and IVs • Data variations could be chance • Test until a significant result appears • Report the tests that were significant • Claim the result is important
Statistical pit… • … is bottomless! • Safe designs • One (or two) IV • Two (or three) conditions • One primary DV • Other stuff is not severely tested
Choosing a test • What’s the data type? • Do you know the distribution? • Within or between • What are you looking for?
Seeing location • Boxplots • Median, IQR, • “Range” • Outliers
Distributions • Theoretical stance • Must have this! • Not inferred from samples
Parametric tests • Normal distribution • Two parameters • Null = one underlying normal distribution • Differences in location (mean)
t-test • Two samples • Two means • Are means showing natural variation? • Compare difference to natural variation
Effect size • How interesting is the difference? • 2s difference in timings • Significance is not same as importance • Cohen’s d
ANOVA • Parametric • Multiple groups • Why not do pairwise comparison? • Get an F value • Follow up tests
ANOVA++ • Multiple IV • So more F values! • Within and between • Effect size, η2 • Amount of variance predicted by IV
Non-parametric tests • Unknown underlying distribution • Heterogeneity of variance • Non-interval data • Usually test location • Effect size is tricky!
Basic tests • Mann-Whitney • Wilcoxon • Kruskal-Wallis • Friedman • No accepted two-way tests
Choosing a test For your fantasy abstract, what test would you choose? Why? Would you change your design?
Questions • Specific problems • Specific tests • Other tests?
Useful Reading • Cairns, Cox, Research Methods for HCI: chaps 6 • Rowntree, Statistics Without Tears • Howell, Fundamental Statistics for the Behavioural Sciences, 6thedn. • Abelson, Statistics as Principled Argument • Silver, The Signal and the Noise
Multivariate • Multiple DV • Multivariate normal distribution • Normal no matter how you slice • MANOVA • Null = one underlying (mv) normal distribution
Issues • Sample size • Assumptions • Interpretation • Communication
Monte Carlo • Process but not distribution • Generate a really large sample • Compare to your sample • Still theoretically driven!
Example • Event = 4 heads in a row from a set of 20 flips of a coin • You have sample of 30 sets • 18 events • How likely? • Get flipping!