1 / 20

Crosstabs

Crosstabs. How do we assess the relationship between two variables? (We’ll bring in more variables later.) Various ways, especially with interval-level data; one of the most common ways is with crosstabs. “Crosstab” is a contraction of “Cross Tabulation”

jorryn
Download Presentation

Crosstabs

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Crosstabs • How do we assess the relationship between two variables? (We’ll bring in more variables later.) Various ways, especially with interval-level data; one of the most common ways is with crosstabs. • “Crosstab” is a contraction of “Cross Tabulation” Also called a contingency table • What is a (simple) crosstab? A table based on two variables, where the cell entries are the counts or percentages of cases that fall in that row or column category. So, also called a bivariate frequency table.

  2. WARNING: • There’s going to be a lot coming at you (in class). • It requires paying attention, thinking (wow!). • But—it only involves percentages, so no complicated “statistics” (yet).

  3. We begin with the observations (persons). A file of about a thousand people would have data like this (except that Gender and Watching TV would be coded using numbers). You would create the cross-tab (presumably using SPSS—these are real bears to do by hand). An example: gender & tv watching

  4. Are Women More Likely to Watch Than Men? • You might want to ask the question: are women more likely to watch this particular tv program than men are? • So, you display the data in a crosstab (in this case a 2x2 table). • But how do you read it?

  5. Almost without exception, you want to look at percentages rather than numbers of cases. • But, which way to percentage? • Add to 100% within categories of the iv. • In the way that makes sense for the question at hand. (Requires thinking.)

  6. We can % either way(by rows) (by columns)

  7. Note I’ve percentaged the total row or col. Not necessary, but often useful, and it’s done automatically by SPSS. I’ve used whole numbers. Nothing about creating tables defines accuracy level. Don’t overdo accuracy.

  8. Which way is correct? • Recall: Dv is the one we’re trying to explain. Iv is the one used to explain the dv. The question we are asking is: are women more likely to watch this particular tv program than men are?

  9. Burning question • Does it make any difference which variable makes up the rows, and which the columns? • No. There’s no agreed-upon convention for whether the iv goes in the rows or columns (text notwithstanding). BUT • If you switch row and column variables, then which percentages are right (for your question) will also change.

  10. More analytical matters. Illustration: Do different careers attract different partisans? (class survey) • Are Democrats or Republicans more likely to go into: • Law? • Politics? • Business? • Academia?

  11. Party ID * Career Recode? Note small # of cases in some rows. Also, are Ind and Other different? DKs? Delete or combine with other rows?

  12. More analytical matters (cont.) • These are analytical matters. Don’t make meaningless combinations just because of small N’s. Keep in/delete “don’t knows” depending on your reasoning about them. • Suppose we decide to keep DKs and combine the three smallest categories.

  13. Recoded • Table is simpler, easier to read. • More meaningful because it doesn’t make distinctions we aren’t really interested in.

  14. Final analytical matter • How much of a difference is enough to be meaningful? Important question For now, see Weisberg et al. reading, pp. 211-12. You might want to look at this when doing your data assignment.

  15. Presentation matters • DO NOT USE SPSS OUTPUT DIRECTLY Reformat as necessary Provide meaningful labels Give it a title Show n’s, %s, not cell counts • Can be easily done in MSWord (maybe other ways as well)

  16. Table 1. Career Interests by Party Identification

  17. Table 1. Career Interests by Party Identification

  18. Data Analysis #1Due one week from today (by ind’s, not pairs) • Directions are on the syllabus • Describe a hypothesis (NES or GSS). Tell us why this hyp makes sense. Thoughtfulness is rewarded. (Dem’s more often voted for Gore is not that thoughtful.) • Discuss the opera’zation of your concepts. Tell us how you operationalized your variables, but also why. Tell us about measurement problems.

  19. Generate an SPSS cross-tab to test your hypothesis. Percentage the table properly. Presentation as noted earlier. • Explain the table: is your hypothesis supported? (More than “yes” or “no” is required.) Note possible alternative explanations.

  20. Usually ≤ 3 pp. (double-spaced) + table Table should go on a separate page. • Writing is important. Use clear, straightforward prose (you are not writing a novel). Proper grammar; correct spelling, punctuation, and capitalization; typo- free

More Related