1 / 53

Anita L. Stewart Institute for Health & Aging University of California, San Francisco

Class 10 Creating Scores and Change Scores, Presenting Measurement Data, Selecting Standard Survey Items November 29, 2007. Anita L. Stewart Institute for Health & Aging University of California, San Francisco. Overview of Class 10.

nika
Download Presentation

Anita L. Stewart Institute for Health & Aging University of California, San Francisco

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Class 10Creating Scores and Change Scores, Presenting Measurement Data, Selecting Standard Survey Items November 29, 2007 Anita L. Stewart Institute for Health & Aging University of California, San Francisco

  2. Overview of Class 10 • Creating summated scales and presenting measurement information • Creating and presenting change scores • “The rest of the survey” • Locating standard survey questions

  3. Creating Likert Scale Scores • Translate codebook scoring rules into program code (SAS, SPSS): • Reverse all items that are not already in desired direction (e.g., higher = better) • Average all items • Allows score if 1 item is answered • Apply missing data rule if different • e.g., if more than 50% items missing

  4. Program Statements: Creating Summated Scale Scores • From last week (SAS statements)

  5. Review Summated Scores • Review scores for out-of-range values, outliers, expected mean • For scores with problems, review programming statements, locate errors and correct • Repeat process until computer algorithm is producing accurate scores • To test programming accuracy, calculate scores by hand from 2 questionnaires • Check that they match computer generated scores

  6. Testing Scaling Properties in Your Sample for Multi-Item Scales • Obtain item-scale correlations • Part of reliability program • Each item correlates at least .30 with the total scale (corrected for overlap)

  7. Testing Scaling Properties in Your Sample for Multi-Item Scales (cont) • Calculate internal-consistency reliability (Cronbach’s alpha) for multi-item scales in your sample • Regardless of reliability in other studies • Internal consistency should be at least .70 • If lower, see if deleting items <.30 will improve it

  8. Presenting Measurement Results (Handout) • Present for each final scale: • % missing • Mean, standard deviation • Observed range, possible range • Floor and ceiling effects, skewness statistic • Range of item-scale correlations • Number of item-scale correlations > .30 • Internal consistency reliability

  9. Overview of Class 10 • Creating summated scales and presenting measurement information • Creating and presenting change scores • “The rest of the survey” • Locating standard survey questions

  10. Change Scores are Important Variables! • Creating change score variables is complex • Requires thought ahead of time • Don’t rely on your programmer • Include specification of change scores in your codebook

  11. Three Types of Change Scores • Measured change • Difference in scores between baseline andfollow-up • Percentage change • Measured change as percent of baseline score • Perceived change • How much change respondent reports (from some prior time period)

  12. Measured Change • Difference in scores from baseline to follow-up • Example measure administered at baseline and 1 month after treatment • Pain in past 2 weeks, 0-10 numeric scale,10 = worst pain

  13. Measured Change (cont) • Hypothetical results • Time 1 (baseline) - score of 5 • Time 2 (one month) - score of 8 • How should change be measured?

  14. Measured Change (cont) Time 1 (baseline) - score of 5 Time 2 (one month) - score of 8 • How should change be measured? • Two options: • Time 2 minus time 1 • Time 1 minus time 2

  15. Measured Change (cont) Time 1 (baseline) - score of 5 Time 2 (one month) - score of 8 • Option one: time 2 minus time 1= +3 • Option two: time 1 minus time 2 = -3 • Interpretation of change score?

  16. Interpretation of Change Score • What do you want the change score to indicate? • Positive change score = improving? • Positive change score = worsening? • Scoring thus depends on: • Direction of scores on original measure (is higher score better or worse?) • Which was subtracted from which?

  17. You want positive score = improvement If high score on measure is better Time 2 minus time 1 If high score on measure is worse Time 1 minus time 2 You want positive score = decline If high score on measure is better Time 1 minus time 2 If high score on measure is worse Time 2 minus time 1 Define Change Score Before Calculation: Algorithms

  18. Example: You Want Positive Score To Indicate Improvement • Hypothetical subject: Improved • Subtract score nearest “worst” end from score nearest “best” end (worst) 0 1 2 3 4 5 6 7 8 9 10 (best) time 1 time 2

  19. Example: You Want Positive Score To Indicate Improvement • Subtract score nearest “worst” end from score nearest “best” end (worst) 0 1 2 3 4 5 6 7 8 9 10 (best) time 1 time 2 Time 2 minus time 1 = +4 (improved by 4 points)

  20. Example: You Want Positive Score To Indicate Improvement (Scale Reversed) • Hypothetical subject: Improved • Subtract score nearest “worst” end from score nearest “best” end (best) 0 1 2 3 4 5 6 7 8 9 10 (worst) time 2 time 1

  21. Example: You Want Positive Score To Indicate Improvement (Scale Reversed) • Subtract score nearest “worst” end from score nearest “best” end (best) 0 1 2 3 4 5 6 7 8 9 10 (worst) time 2 time 1 Time 1 minus time 2 = +4 (improved by 4 points)

  22. If high score on measure = better Calculate change score so positive change score = improved Time 2 minus time 1 If high score on measure = worse Calculate change scores so positive change score = improved Time 1 minus time 2 Recommendation: Make Change Score Intuitively Meaningful

  23. Interpreting “Measured Change” Scores: What is Wrong? • In a study predicting utilization of health care (outpatient visits) over a 1-year period as a function of self-efficacy… • A results sentence: • “Reduced utilization at one year was associated with level of self efficacy at baseline (p < .01) and with 6-month changes in self efficacy (p < .05).”

  24. Interpreting “Measured Change” Scores: Making it Clearer • “Reduced outpatient visits at one year were associated with lower levels of self efficacy at baseline (p < .01) and with 6-month improvements in self efficacy.” • Old way: • “Reduced utilization at one year was associated with level of self efficacy at baseline (p < .01) and with 6-month changes in self-efficacy.”

  25. Three Types of Change Scores • Measured change • Difference in scores between baseline andfollow-up • Percentage change • Measured change as percent of baseline score • Perceived change • How much change respondent reports (from some prior time period)

  26. Presenting Change Scores in Tables: What is Wrong? • Change in anxiety over a 1-year period for two groups 1 year change in anxiety p Exercise group - 40 < .001 Education group +4 ns

  27. Presenting Change Scores in Tables: Making it Clearer • Change in anxiety over a 1-year period for two groups 1 year change in anxiety p Exercise group - 40 < .001 Education group +4 ns *Negative score indicates decreased anxiety (change scores are 1-year minus baseline)

  28. Reliability of Change Score • Difference scores have been criticized as having low reliability • Nunnally (1994) considers alternatives and suggests this may not be as large a problem as previously thought (p. 247) Nunnally JC and Bernstein IH. Psychometric Theory, Third Edition, McGraw-Hill, New York, 1994.

  29. Percentage Change • Measured change divided by baseline score • Example: pain measure, higher is more pain • change score of -2, baseline score of 6 • 2/6 = 33% reduction in pain

  30. Example of Percentage Change Problem with Likert Scales • You want a positive change to indicate improvement (and high score is better) • Subtract score nearest “worst” end from score nearest “best” end (worst) 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 (best) time 1 time 2 Time 2 minus Time 1 = change of +4 (improved by 4 points) Change of 4 / baseline score of 8 = 50% improvement

  31. Example of Percentage Change Problem with Likert Scales (cont.) • You want a positive change to indicate improvement • high score is worse • Subtract score nearest “best” end from score nearest “worst” end (best) 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 (worst) time 2 time 1 Time 1 minus Time 2 = change of +4 (improved by 4 points) Change of 4 / baseline score of 16 = 25% improvement

  32. Percentage Change Scores Only Work for Ratio-Level Measures • Can do percentage change only on scales with a true zero • zero represents the absence of the trait in question • Ratio scores - weight in pounds • Person weighs 150 pounds • Gains 10, gained 15% of original weight • Loses 10, lost 15% of original weight

  33. Three Types of Change Scores • Measured change • Difference in scores between baseline andfollow-up • Percentage change • Measured change as percent of baseline score • Perceived change • How much change respondent reports (from some prior time period)

  34. Perceived Change (Retrospective Change) • How much has your physical functioning changed since your surgery? 1 - very much worse 2 - much worse 3 - worse 4 - no change 5 - better 6 - much better 7 - very much better

  35. Perceived Change (Retrospective Change) – Better Response Choice? • How much has your physical functioning changed since your surgery? -3 Very much worse -2 Much worse -1 Worse 0 No change 1 Better 2 Much better 3 Very much better

  36. Perceived/Retrospective Change • Perceived change enables respondent to define physical functioning in terms of what it means to them • Measured change is a change on specific questions that were contained in the particular measure

  37. Example of Measured Change • Baseline and 6-month limitations: • Difficulty walking • Difficulty climbing stairs • Measured change: change on these 2 physical functions • If person had no change walking or climbing stairs • Score would be “no change”

  38. Example of Perceived Change • To what extent did your physical functioning change over the past 6 months? • Much worse • Worse • No change • Better • Much better • If person has more trouble bending over, and considers this as part of physical functioning, they will report becoming worse

  39. Perceived/Retrospective Change • Recommend including both types of measures to assess change • Measured change enables • Comparison with other studies • May be more sensitive because has more scale levels (if multi-item measure) • Investigator defines clinically relevant outcomes • Perceived/Retrospective change enables • Person to report on domain using their own definition • Picks up changes “unmeasured” by particular measure

  40. Overview of Class 10 • Creating summated scales and presenting measurement information • Creating and presenting change scores • “The rest of the survey” • Locating standard survey questions

  41. Locating “Standard” Survey Questions • MD characteristics • Comorbidity, chronic conditions • Medical history, family history • Health behaviors

  42. Demographics – Just About Everywhere • Basic demographics • Socioeconomic status • Financial information (assets, income, wealth) • Employment, occupation • Retirement • Health insurance

  43. Take Away Point: • Don’t write these yourself • Use standard questions from appropriate existing surveys

  44. National and State Surveys • Population surveys • Tend to have single-item measures rather than multi-item scales • Good for “standardized” survey items

  45. State Surveys • http://www.chis.ucla.edu/ • California Health Interview Survey (CHIS) • “Questionnaires” • See contents of 2006 CHIS: adults and adolescents

  46. National Surveys • Behavioral Risk Factor Surveillance System Questionnaires • http://www.cdc.gov/brfss/questionnaires/questionnaires.htm • See contents of 2006 BRFSS • English and Spanish

  47. MacArthur Research Network on Socioeconomic Status and Health • Measures of economic status, occupational status, education, and perceived social status • Includes rationale • http://www.macses.ucsf.edu/Research/Social%20Environment/notebook/economic.html • Also basic demographics

  48. Center for Aging in Diverse Communities (CADC) • Recommends items measuring socioeconomic status • Education, income, race/ethnicity, place of birth/generation, English language proficiency, financial hardship • Main website: http://medicine.ucsf.edu/cadc/cores/measurement/index.html

  49. Cancer Research Measures • The Division of Cancer Epidemiology and Genetics • Demographics, medical history, family history, other risk factors http://dceg.cancer.gov/QMOD/

  50. Non-English Language? • California Health Interview Survey • Numerous languages • Spanish language surveys • SALSA • Hispanic Health and Nutrition Examination Survey (HHANES) • National Mexican Health and Aging Study • Behavioral Risk Factor Surveillance System (CDC)

More Related