220 likes | 231 Views
Education 793 Class Notes. Descriptive Statistics, Central Tendency and Variability 10 September 2003. Today’s agenda. Lab details finalized Your announcements Chapter 3: Frequency Distributions Chapter 4: Central Tendency and Variability. Data Matrix.
E N D
Education 793 Class Notes Descriptive Statistics, Central Tendency and Variability 10 September 2003
Today’s agenda • Lab details finalized • Your announcements • Chapter 3: Frequency Distributions • Chapter 4: Central Tendency and Variability
Data Matrix Data Matrix is defined as a matrix in which subjects are listed in the rows and variables corresponding to each subject are listed in the columns Example
Basic Terms and Concepts • Frequency Distribution: Orders values from lowest to highest and gives the number or percent of subjects with each value. • Can be presented with: • tabular form • histogram • frequency polygon • stem and leaf plot
Shape of Frequency Distributions • Symmetric: when two halves of the distribution mirror each other
Rectangular Bimodal Unimodal Multimodal Modality • Modality: the number of relative peaks the distribution exhibits
Skew A. B. Left (Negative) Skew Right (Positive) Skew
Kurtosis (Peakedness) Flat Peak, No Tails High Peak, Long Tails
Central Tendency and Variability 1) Central tendency Mode Median Mean 2) Variation Range Semi-interquartile Range Variance Standard deviation
Central Tendency Mode Most frequently occurring score in a distribution Median Point on the distribution below which one-half (50%) of the scores fall Mean Arithmetic average of scores within a distribution Special properties: Sum of deviations of scores mean is zero Sum of squares of deviations is at a minimum
Mean, Median, Mode A distribution of GRE scores: 340 450 510 580 580 600 620 660 670 710 Mode: Median: Mean: 572 A second distribution of GRE scores: 340 450 510 580 580 600 620 660 670 1710* Mode: Median: Mean: 672 *Heidi’s score
Shapes and Statistics A. B.
Variability Identifying the middle of a distribution as revealed by measures of central tendency is of limited value unless one also knows how much the scores in the distribution differ from each other. As such, measures of how much scores differ from each other should always accompany measures of central tendency. These measure of how much the scores differ from each other are called measures of "spread“ or variability.
Basic Measures • Range The range is the simplest measure of spread: It is equal to the difference between the largest and the smallest values. The range can be a useful measure of spread primarily because it is so easily understood, but it is otherwise seldom used in real statistical practice. It is very sensitive to extreme scores, however, because it is based on only two values. • The semi-interquartile range Computed as one half the difference between the 75th percentile (often called Q3) and the 25th percentile (Q1), or (Q3 - Q1) / 2. Because half the scores in a distribution lie between Q3 and Q1, semi-interquartile range is 1/2 the distance needed to cover 1/2 the scores. In a symmetric distribution, an interval stretching from one semi-interquartile range below the median to one semi- interquartile above the median will contain 1/2 of the scores. This will not be true for a skewed distribution, however.
Variance and Standard Deviation The variance is a widely used measure of spread. It is computed as the average squared deviation of each number from its mean. The formula (in summation notation) for the variance in a population in which M is the mean and N is the number of scores is: The standard deviation is the square root of the variance. It is the most commonly used measure of spread. An important attribute of the standard deviation as a measure of spread is that it is possible to compute the percentile rank associated with any given score if the mean and standard deviation of a normal distribution are known.
Calculating Standard Deviation Or to calculate by hand there is a shortcut formula
Numbers =13.5
Next Week • Coursepack • Say it with Figures Chapter7: • The Cross-Tabulation Refines • and Available through JSTOR at www.lib.umich.edu