Chapter 4

Chapter 4 Calculating and Evaluating Validity

When We Have Validity . . . We have an acceptably accurate measurement. We are measuring what we intend to measure. We are interpreting and applying the measurement appropriately.

Methods for Evaluating Validity and Their Calculations

Population and Sample Population: The designated group being measured Sample: A representative subgroup of the population Inferential statistics: Generalization about a population based on what is learned about the sample

Using t-tests to Compare Sample Means Purpose of t-tests: Compare two sample means Determine if a sample represents a population Types: Paired vs. independent One-tailed vs. two-tailed

Results of a t-testfor the VO2max-12-minute run example

Interpreting t-tests p value: The probability that the difference between two means is a coincidence of random sampling Sometimes a t-test comparison of means doesn’t give us everything we need to know; we need to compare individual differences in scores.

Correlations Defined: Measurement of the strength of the relationship between two variables. Pearson Product-Moment Correlation: Compares individual differences between two methods of measurement

Results of a PearsonProduct-Moment Correlationfor the VO2max-12-minute run example

Methods for Evaluating a Correlation r value (or correlation coefficient) Trendline R2 (or coefficient of determination)

Slope of a Trendline Defined: The change in y value per unit change in x value. y value = the slope * the x value + the y intercept (point where y = 0) Or y = mx + b

Scatter Plot Showing Trendline

Bland-Altman Analysis A simple approach for evaluating criterion validity—through error analysis: Data taken from an unusually diverse group of subjects will show an artificially high correlation. Analysis focuses attention on error scores.

Spreadsheet for a Bland-Altman Analysis

Bland-Altman Analysis

Evaluating Validity of Ranked Data Ordinal numbers: Ranked numbers that give a place in line but no information about distances between numbers; place holders. Interval numbers: Numbers that are separated by equal intervals; scalar numbers.

Spearman’s rho A special correlation used in cases where one variable is an ordinal number Calculating Spearman’s rho: Excel doesn’t offer the calculation Some websites can do it; for example, www.wessa.net/rankcorr.wasp

Evaluating the Validity of Criterion-Referenced Tests Validity ratio: The ratio of number of scores classified correctly to the total number of scores. Validity ratio = (CR+NCR)/(CR+CW+NCR) where CR = competent and classified correctly NCW = not competent and wrongly classified CW = competent but wrongly classified and NCR = not competent and classified correctly

Example Diagram for Evaluating the Validity of a Criterion-Referenced Measurement

Your Viewpoint After reading the chapter, go back and read the chapter-opening quote from Henry Clay: “Statistics are no substitute for human judgment.” Think also about the famous quote, “There are three kinds of lies: lies, damned lies, and statistics.” Do you agree with the sentiments expressed by these authors? Why or why not?

Chapter 4

Chapter 4

Presentation Transcript

Chapter 4

Chapter 4

Chapter 4

Chapter 4

Chapter 4

Chapter 4

Chapter 4

Chapter 4-4

Chapter 4

Chapter 4

Chapter 4 - 4

Chapter 4

CHAPTER 4

Chapter 4

Chapter 4

CHAPTER 4

Chapter 4

Chapter 4

CHAPTER 4

Chapter 4

Chapter 4

Chapter 4