1 / 11

Thinking about Data:

Thinking about Data:. Terms: matrix unit of analysis case variable code. Types of Data.

luke-ware
Download Presentation

Thinking about Data:

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Thinking about Data: • Terms: • matrix • unit of analysis • case • variable • code

  2. Types of Data • Microlevel: data collected on the characteristics of individual cases, people, houses, events, that is, discrete units. For example an individual, with characteristic information on sex, age, state of residence, etc. • Aggregate: Tabular data representing counts of units falling into particular categories, e.g., populations of states. The state is the unit of analysis; the variables are the name of the state and the population of the state.

  3. Sources of Data • Survey: collected specifically for the research purpose, e.g., CPS, GSS, census. • Administrative record: records of immigrant arrivals by port; tax filings; vital registration records; case files of judicial proceedings, health records.

  4. Univariate Statistics • Types of Variables: Nominal; ordinal, interval, ratio • Measures of central tendency: mean, median, mode • Measures of dispersion: standard deviation, ntiles, range, coefficient of variation • Measures of shape: skewness, kurtosis.

  5. YRBUILT N of cases 1235 Minimum 888.000 Maximum 929.000 Range 41.000 Sum 1116660.000 Median 904.000 Mean 904.178 95% CI Upper 904.724 95% CI Lower 903.633 Std. Error 0.278 Standard Dev 9.770 Variance 95.451 C.V. 0.011 Skewness(G1) 0.409 SE Skewness 0.070 Kurtosis(G2) -0.528 SE Kurtosis 0.139 CONCOST 1127 20.000 5200.000 5180.000 354277.000 250.000 314.354 332.888 295.820 9.446 317.116 100562.250 1.009 5.563 0.073 59.859 0.146 STATS YRBUILT CONCOST / Mean Min Max SD CV Kurtosis Median Range SEK SEM SES Skewness Sum Variance N CIM=.95

More Related