Inferences From Nominal Data: The Χ2 Statistic

Chapter 22 Inferences From Nominal Data: the Χ2 Statistic

Χ2 test of frequencies • Pronounced “ki - square” • Can be used for nominal level data • data with only the category property • The Χ2 test is based on a comparison of expected frequencies (fe) versus observed frequencies (fo)

Example of frequencies (refresher) • Twenty-five miners were asked about their political affiliation - Republican, Democrat, Independent, None • Since the data, political affiliation, have only the category property, they are nominal level, and can only be counted

Observed Frequencies

What was expected? • Let’s say, that on average, throughout the country, the percentages of people who report the following political party affiliations are: Democrat - 45% Republican - 40% Independent - 10% None - 5%

Is the survey of miners different from the national average? • Computing expected frequencies is then done by multiplying the number of observations with the percent expected: fe = n(% expected)

Expected frequencies On average, then we would expect: Democrats = 25 (.45) = 11.25 Republicans = 25 (.40) = 10 Independents = 25 (.10) = 2.5 None = 25 (.05) = 1.25

Χ2 statistic • Χ2 statistic is calculated using the following formula:

Χ2 –test on miners political affiliation

df in Χ2 test • df = number of categories - 1 • In the present case, 4 categories - 1 = 3

Interpretations of Χ2 statistic • If the differences between the observed and expected frequencies are small, these differences squared will also be small, thus making the sum of these squared differences small also • Thus, small differences between observed and expected = small Χ2 statistic

Interpretations of Χ2 statistic • However, if the differences between observed and expected frequencies are large, then these differences squared will be large also, making the sum of the squared differences large • Thus, large differences = large Χ2statistic

Interpretations of Χ2 statistic • How large is large? • Fortunately, the Χ2 statistic is based on a probability distribution of known parameters • Table G in your text provides critical values for Χ2 tests

Survey of Miners HO : Frequency of Dems = 11.25 Frequency of Reps = 10 Frequency of Ind = 2.5 Frequency of None = 1.25 HA: HO is incorrect Χ2 = .05, Χ2crit .05 (df = 3) = 7.81

Survey of Miners • Since the obtained Χ2 = 11.00 is larger than the critical Χ2 = 7.81, we • Reject HO that the obtained frequencies are 11.25, 10, 2.5, and 1.25 respectively, and • Conclude that the obtained frequencies were different than expected

Inferences From Nominal Data: The Χ2 Statistic

Inferences From Nominal Data: The Χ2 Statistic

Presentation Transcript

Chapter 22

Chapter 22.

CHAPTER 22

Chapter 22

Chapter 22

Chapter 22

Chapter 22

Chapter 22

Chapter 22

Chapter 22

Chapter 22

Chapter 22

(Chapter 22)

Chapter 22

Chapter 22