1 / 18

Conducting a User Study

Conducting a User Study. Human-Computer Interaction. Overview. Why run a study? Evaluate if a statement is true Ex. The heavier a person weighs, the higher their blood pressure Many ways to do this: Look at data from a doctor’s office What’s the pros and cons?

flynn
Download Presentation

Conducting a User Study

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Conducting a User Study Human-Computer Interaction

  2. Overview • Why run a study? • Evaluate if a statement is true • Ex. The heavier a person weighs, the higher their blood pressure • Many ways to do this: • Look at data from a doctor’s office • What’s the pros and cons? • Get a group of people to get weighed and measure their BP • What’s the pros and cons? • Ideal solution: have everyone in the world get weighed and BP • Participants are a sample of the population • You should immediately question this! • Restrict population

  3. Population Design • Identify the statement to be evaluated • Ex. The heavier a person weighs, the higher their blood pressure • Create a hypothesis • Ex. Weight is directly proportional to blood pressure • Identify Independent and Dependent Variables • Independent Variable – the variable that is being manipulated by the experimenter (weight) • Dependent Variable – the variable that is caused by the independent variable. (blood pressure) • Design Study • Invite 100 people • Weigh them and take their BP • Graph • See if there is a trend

  4. Two Group Design • Identify the statement to be evaluated • Ex. Shorter people are smarter than taller people • Create a hypothesis • Ex. IQ of people shorter than 5’9” > IQ of people 5’9” or taller • Design Study • Two groups called conditions • How many people? • What’s your design? • What is the independent and dependent variables? • Confounding factors – factors that affect outcomes, but are not related to the study

  5. Design • External validity – do your results mean anything? • Results should be similar to other similar studies • Use accepted questionnaires, methods • Power – how much meaning do your results have? • The more people the more you can say that the participants are a sample of the population • Generalization – how much do your results apply to the true state of things

  6. Design • People who use a mouse and keyboard will be faster to fill out a form than keyboard alone. • Let’s create a study design • Two types: • Between Subjects • Across Subjects • Everyone do this now for your study

  7. Procedure • Formally have all participants sign up for a time slot (if individual testing is needed) • Informed Consent (let’s look at one) • Execute study • Questionnaires/Debriefing (let’s look at one)

  8. Hypothesis Proving • Hypothesis: • People who use a mouse and keyboard will be faster to fill out a form than keyboard alone. • US Court system: Innocent until proven guilty • NULL Hypothesis: Assume people who use a mouse and keyboard will fill out a form than keyboard alone in the same amount of time • Your job to prove differently! • Alternate Hypothesis 1: People who use a mouse and keyboard will fill out a form than keyboard alone, either faster or slower. • Alternate Hypothesis 2: People who use a mouse and keyboard will fill out a form than keyboard alone, faster.

  9. Analysis • Most of what we do involves: • Normal Distributed Results • Independent Testing • Homogenous Population

  10. Raw Data • What does the mean (average) tell us? Is that enough?

  11. Small Pattern (seconds) Large Pattern (seconds) Mean S.D. Min Max Mean S.D. Min Max Real Space (n=41) 16.81 6.34 8.77 47.37 37.24 8.99 23.90 57.20 Purely Virtual (n=13) 47.24 10.43 33.85 73.55 116.99 32.25 70.20 192.20 Hybrid (n=13) 31.68 5.65 20.20 39.25 86.83 26.80 56.65 153.85 Vis Faith Hybrid (n=14) 28.88 7.64 20.20 46.00 72.31 16.41 51.60 104.50 Variances • standard deviation – measure of dispersion (square root of the sum of squares divided by N)

  12. Small Pattern (seconds) Large Pattern (seconds) Mean S.D. Min Max Mean S.D. Min Max Real Space (n=41) 16.81 6.34 8.77 47.37 37.24 8.99 23.90 57.20 Purely Virtual (n=13) 47.24 10.43 33.85 73.55 116.99 32.25 70.20 192.20 Hybrid (n=13) 31.68 5.65 20.20 39.25 86.83 26.80 56.65 153.85 Vis Faith Hybrid (n=14) 28.88 7.64 20.20 46.00 72.31 16.41 51.60 104.50 Hypothesis • We assumed the means are “equal” • But are they? Or is the difference due to chance?

  13. T - test • T – test – statistical test used to determine whether two observed means are statistically different

  14. T – test • (rule of thumb) Good values of t > 1.96 • Look at what contributes to t • http://socialresearchmethods.net/kb/stat_t.htm

  15. F statistic, p values • F statistic – assesses the extent to which the means of the experimental conditions differ more than would be expected by chance • t is related to F statistic • Look up a table, get the p value. Compare to α • α value – probability of making a Type I error (rejecting null hypothesis when really true) • p value – statistical likelihood of an observed pattern of data, calculated on the basis of the sampling distribution of the statistic. (% chance it was due to chance)

  16. Small Pattern Large Pattern t – test with unequal variance p – value t – test with unequal variance p - value PVE – RSE vs. VFHE – RSE 3.32 0.0026** 4.39 0.00016*** PVE – RSE vs. HE – RSE 2.81 0.0094** 2.45 0.021* VFHE – RSE vs. HE – RSE 1.02 0.32 2.01 0.055+

  17. Significance • What does it mean to be significant? • You have some confidence it was not due to chance. • But difference between statistical significance and meaningful significance • Always know: • samples (n) • p value • variance/standard deviation • means

  18. IRB • http://irb.ufl.edu/irb02/index.html • Let’s look at a completed one • You MUST turn one in by October 28th to the TA! • Must have OKed before running study

More Related