1 / 14

Lecture 15:

Lecture 15: . Univariate Descriptive Statistics. Agenda. Finding Data for Quantitative Analysis Using Descriptive Statistics. Finding Data for Quantitative Analysis. Berkeley Survey Research Center http://srcweb.berkeley.edu/ UC Data (access to ICPSR) http://ucdata.berkeley.edu

reed
Download Presentation

Lecture 15:

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Lecture 15: Univariate Descriptive Statistics

  2. Agenda • Finding Data for Quantitative Analysis • Using Descriptive Statistics

  3. Finding Data for Quantitative Analysis • Berkeley Survey Research Center • http://srcweb.berkeley.edu/ • UC Data (access to ICPSR) • http://ucdata.berkeley.edu • Data on the Net • http://3stages.org/idata/

  4. Thinking about Data Sources for Final Exam

  5. Probability and Statistics • Statistics deal with what we observe and how it compares to what might be expected by chance. • A set of probabilities corresponding to each possible value of some variable, X, creates a probability distribution • For now, we especially care about the normal (Gaussian) distribution

  6. The Normal Curve

  7. Describing Simple Distributions of Data • Central Tendency • Some way of “typifying” a distribution of values, scores, etc. • Mean (sum of scores divided by number of scores) • Median (middle score, as found by rank) • Mode (most common value from set of values) • In a normal distribution, all 3 measures are equal. • Example: GSS data

  8. Special Features of the Mean • Sum of the deviations from the mean of all scores = zero. • Unlike the median, it is the point in a distribution where the two halves are balanced.

  9. Using Central Tendencies in Recoding • “splitting” metrics into binary variables • “collapsing” variables

  10. Dispersion • Range • Difference between highest value and the lowest value. • Standard Deviation • A statistic that describes how tightly the values are clustered around the mean. • Variance • A measure of how much spread a distribution has. • Computed as the average squared deviation of each value from its mean

  11. Properties of Standard Deviation • Variance is just the square of the S.D. (or, S.D is the square root of the variance) • If a constant is added to all scores, it has no impact on S.D. • If a constant is multiplied to all scores, it will affect the dispersion (S.D. and variance) S = standard deviationX = individual scoreM = mean of all scoresn = sample size (number of scores)

  12. Why Variance Matters… • In many ways, this is the purpose of many statistical tests: explaining the variance in a dependent variable through one or more independent variables.

  13. Common Data Representations • Histograms • Simple graphs of the frequency of groups of scores. • Stem-and-Leaf Displays • Another way of displaying dispersion, particularly useful when you do not have large amounts of data. • Box Plots • Yet another way of displaying dispersion. Boxes show 75th and 25th percentile range, line within box shows median, and “whiskers” show the range of values (min and max)

More Related