310 likes | 417 Views
Quantitative Methods for Researchers. Paul Cairns paul.cairns@york.ac.uk. Objectives. Statistical argument Comparison of distributions A fly-by of approaches. How are the abstracts?. Questions? Problems? Restarts?. Statistical Argument. Inference is an argument form
E N D
Quantitative Methods for Researchers Paul Cairns paul.cairns@york.ac.uk
Objectives • Statistical argument • Comparison of distributions • A fly-by of approaches
How are the abstracts? • Questions? • Problems? • Restarts?
Statistical Argument • Inference is an argument form • Prediction is essential • Alternative hypothesis • “X causes Y” • No prediction – measuring noise
Gold standard argument • Collect data • Data variation could be chance (null) • Predict the variations (alternative) • Statistics give probabilities • Unlikely predictions “prove” your case
Implications • Must have an alt hyp • No multiple testing • No post hoc analysis • Need multiple experiments
Silver standard argument • Collect data • Data variations could be chance (null) • Are there “real” patterns in the data? • Use statistics to suggest (unlikely) patterns • Follow up findings with gold standard work
Fishing: This is bad science • Collect lots of data • DVs and IVs • Data variations could be chance • Test until a significant result appears • Report the tests that were significant • Claim the result is important
Statistical inference • Model comparison: • Single distribution (null) • Multiple distributions (alternative) • From samples, which model is better? • From samples, is null likely?
What terms do you know? • The statistical zoo!
Choosing a test • What’s the data type? • Do you know the distribution? • Within or between • What are you looking for?
Distributions • Theoretical stance • Must have this! • Not inferred from samples
Parametric tests • Normal distribution • Two parameters • Null = one underlying normal distribution • Differences in location (mean)
t-test • Two samples • Two means • Are means showing natural variation? • Compare difference to natural variation
Effect size • How interesting is the difference? • 2s difference in timings • Significance is not same as importance • Cohen’s d
ANOVA • Parametric • Multiple groups • Why not do pairwise comparison? • Get an F value • Follow up tests
ANOVA++ • Multiple IV • So more F values! • Within and between • Effect size, η2 • Amount of variance predicted by IV
Non-parametric tests • Unknown underlying distribution • Heterogeneity of variance • Non-interval data • Usually test location • Effect size is tricky!
Wilcoxon test • See sheet
Seeing location • Boxplots • Median, IQR, • “Range” • Outliers
Multivariate • Multiple DV • Multivariate normal distribution • Normal no matter how you slice • MANOVA • Null = one underlying (mv) normal distribution
Issues • Sample size • Assumptions • Interpretation • Communication
Your abstract • What sort of data will you produce? • Can you theorise about the distribution? • What sort of test do you think you will need?
Health warnings • Craft skill • Simpler is better • Doing it • Interpreting it • Communicating it • Experiments as evidence • Software packages are deceptively easy
Q & A • Any question about any aspect • Very general or very specific • Any research method!
Useful Reading • Cairns, Cox, Research Methods for HCI: chaps 6 • Rowntree, Statistics Without Tears • Howell, Fundamental Statistics for the Behavioural Sciences, 6thedn. • Abelson, Statistics as Principled Argument • Silver, The Signal and the Noise
Monte Carlo • Process but not distribution • Generate a really large sample • Compare to your sample • Still theoretically driven!
Example • Event = 4 heads in a row from a set of 20 flips of a coin • You have sample of 30 sets • 18 events • How likely? • Get flipping!