260 likes | 433 Views
Engineering Applications of Statistics (will be known to us eventually as we go through our syllabus). Statistical Process Control Quality Assessment Model Building and Predicting Communicating with and Acting on Experimental Results Assessing Design Reliability Experimental Design.
E N D
Engineering Applications of Statistics (will be known to us eventually as we go through our syllabus) • Statistical Process Control • Quality Assessment • Model Building and Predicting • Communicating with and Acting on Experimental Results • Assessing Design Reliability • Experimental Design
VARIABLE AND DATA • Variable • A characteristic of a population or sample that is of interest to us. • Example • In case of Hudson Auto a variable is the average cost of parts used in engine tune-ups.
91 78 93 57 75 52 99 80 97 62 71 69 72 89 66 75 79 75 72 76 104 74 62 68 97 105 77 65 80 109 85 97 88 68 83 68 71 69 67 74 62 82 98 101 79 105 79 69 62 73 VARIABLE AND DATA • Data • The actual values of the variables • Data may be quantitative, qualitative • Example • In case of Hudson Auto data are the actual costs of parts used in 50 engine tune-ups observed:
QUANTITATIVE DATA • Quantitative data may have a ratio scale • Data possessing a natural zero point and organized into measures for which differences are meaningful. • Examples: Money, income, sales, profits, losses, heights of NBA players • Quantitative data may also have an interval scale • The distance between numbers is a known, constant size, but the zero value is arbitrary. • Examples: Temperature on the Fahrenheit scale.
Types of Quantitative Data A) Nominal data: They are numbers that represent arbitrary data. Example: An engineering school, might use numbers to denote undergraduate majors. 1 For electrical engineering 2 For civil engineering and so on.
Types of Quantitative Data B) Ordinal data convey ranking in terms of importance, strength, or severity. Example: a value of 3 corresponds to gentle breeze A value of 6 corresponds to a strong breeze A value of 9 signifies a strong gale The change in force between a gentle and a strong sea breeze is not equal to that between a strong gale and a strong breeze, as the numbers themselves would misleadingly indicate.
Types of Quantitative Data C) Interval data allow only addition and subtraction. Example: Temperature for which scales are arbitrarily chosen. A 100 degree Fahrenheit day is not twice as hot as 50 degree day since 100 degree/ 50 degree is not a meaningful ratio.
Types of Quantitative Data D) Ratio Data include times and many physical measurements of size, weight, or strength. The arithmetic operations of addition, subtraction, division, and multiplication are all valid with ratio data. Most statistical investigations involve arithmetic operations, which limits them to interval or ratio data. Example: income, sales, profits.
QUALITATIVE DATA • Qualitative data has a nominal scale • Data that can only be classified into categories and cannot be arranged in an ordering scheme. • Examples: eye color, gender, marital status, religious affiliation, etc. • Examples: Suppose that the responses to the marital-status question is recorded as follows: Single 1 Divorced 3 Married 2 Widowed 4
85:222: Chapter 2Describing, Displaying, and Exploring Statistical Data Assuming you have collected a data set of your interest The questions are: How can you make sense out of it? How can you organize and summarize the data set to make it more comprehensible and meaningful?
Graphical techniques for Describing Quantitative DataThe Frequency Distribution • The most popular and traditional graphical method for describing quantitative data is the frequency histogram– often called a frequency distribution.
HISTOGRAM Consider the following data that shows days to maturity for 40 short-term investments
HISTOGRAM • First, construct a frequency distribution • An arrangement or table that groups data into non-overlapping intervals called classes and records the number of observations in each class • Approximate number of classes: See Table 2.3, p. 29 Number of observation Number of classes Less than 50 5-7 50-200 7-9 200-500 9-10 500-1,000 10-11 1,000-5,000 11-13 5,000-50,000 13-17 More than 50,000 17-20
HISTOGRAM • Approximate class width is obtained as follows:
HISTOGRAM Classes and counts for the days-to-maturity data
HISTOGRAM • Class relative frequency is obtained as follows:
HISTOGRAM • Lower cut point: The smallest value that can go in a class. • Upper cut point: The smallest value that can go in the next higher class. The upper cut point of a class is the same as the lower cut point of the next higher class. • Midpoint: The middle of a class, obtained by taking the average of its lower and upper cut points. • Width: The difference between the upper and lower cut points of a class.
RELATIVE FREQUENCY HISTOGRAM Relative-frequency distribution for the days-to-maturity data
FREQUENCY POLYGON • A frequency polygon is a graph that displays the data by using lines that connect points plotted for frequencies at the midpoint of classes. • The frequencies represent the heights of the midpoints.
STEM-AND-LEAF DISPLAY • Here the raw data are arranged tabularly by locating each observation on a ‘tree’ This is done by separating the values into a stem digit and a leaf digit. • The main advantage is that it provides the essential features of the histogram; it can be seen easily by rotating the plot 90 degrees • The disadvantage is that the stem-and-leaf plot becomes cumbersome if the number of observations is large
STEM-AND-LEAF DISPLAY Diagrams for days-to-maturity data: (a) stem-and-leaf Stem Leaves 3 4 5 6 7 8 9 (a)