670 likes | 794 Views
MBA Statistics 51-651-00. http://www.hec.ca/sites/cours/51-651-02/. What is statistics?.
E N D
MBAStatistics 51-651-00 http://www.hec.ca/sites/cours/51-651-02/
What is statistics? "I like to think of statistics as the science of learning from data... Statistics is essential for the proper running of government, central to decision making in industry, and a core component of modern educational curricula at all levels." Jon Kettenring ASA President, 1997
What is statistics? • American Heritage Dictionary® defines statistics as: "The mathematics of the collection, organization, and interpretation of numerical data, especially the analysis of population characteristics by inference from sampling.« • The Merriam-Webster’s Collegiate Dictionary® definition is: "A branch of mathematics dealing with the collection, analysis, interpretation, and presentation of masses of numerical data."
Course syllabus • Variation. Sampling and estimation. • Decision making from statistical inference. • Qualitative data analysis. • Simple and multiple linear regression. • Forecasting. • Statistical process control. • Revision.
EVALUATION • Teamwork: 40% • Final exam: 60%
COURSE # 1 Variation, sampling and estimation.
Variation "The central problem in management and in leadership ... is failure to understand the information in variation" W. Edwards Deming
Variation "Managementtakes a major step forward when they stop asking you to explain random variation" F. Timothy Fuller
Variation "Failure to understand variation is a central problem of management" Lloyd S Nelson
AirportImmigration • Management expected their officers to process 10 passengers during this period • The immigration services manager, in reviewing these figures, was • concerned about the performance of Colin • thinking how best to reward Frank
Debt Recovery • When the amount of recovered debt is much lower than the target recovery level of 80%, the General Manager visits all the District Offices in New Zealand to remind managers of the importance of customers paying on time • What do you think of the GM policy? • What would you do?
Budget Deviations • Budget deviations measure the difference between the amount budgeted and the actual amount, expressed as a percentage of the budgeted amount. • The aim is to have a zero deviation. • Most of the variation lies between -3% and 4%.
Illustration of variation Excel program:beads.xls(Deming) The red balls are associated with defective products. Five times a day, 5 technicians select a sample of 50 beads and counts the number of defectives (red).
Discussion • What is the main difference between the graph of the 25x27=675 draws and the graph of the 27 averages?
Two approaches in management • Fire-fighting • Scientific
Problem Solution Fire-fighting approach
Problem Solution Cause Scientific Approach
Scientific Approach • Making decisions based on data rather than hunches. • Looking for the root causes of problems rather than reacting to superficial symptoms. • Seeking permanent solutions rather than quick fixes.
The Need for Data • To understand the process • To determine priorities • To establish relationships • To monitor the process • To eliminate causes of variation
The steps of statistical analysis involve: • Planning the collection of information • Collecting information • Evaluating information • Drawing conclusions
Surveys: • Collect information from a carefully specified sample and extend the results to an entire population. • Sample surveys might be used to: • Determine which political candidate is more popular • Discover what foods teenagers prefer for breakfast • Estimate the number of potential clients
Population Sample Parameter Statistic Sampling Definitions choose estimate calculate
Government Operations: • Conduct experiments to aid in the development of public policy and social programs. • Such experiments include: • consumer prices; • fluctuations in the economy; • employment patterns; • population trends.
Scientific Research: • Statistical sciences are used to enhance the validity of inferences in: • radiocarbon dating to estimate the risk of earthquakes; • clinical trials to investigate the effectiveness of new treatments; • field experiments to evaluate irrigation methods; • measurements of water quality; • psychological tests to study how we reach the everyday decisions in our lives.
Business and Industry: • predict the demand for products and services; • check the quality of items manufactured in a facility; • manage investment portfolios; • forecast how much risk activities entail, and calculate fair and competitive insurance rates.
Sampling • Our knowledge, our attitudes and our actions are mainly based on samples. • For example, a person’s opinion of an institution or a company which makes thousands of transactions every day is often determined by only one or two meetings with this institution.
Census vs Sample • Census = reality (True or false?!) • The information needed is available for all individuals of the study population. • Sample = estimation of the reality • The information needed is only available for a subset of the individuals of the study population.
Advantages of a sample • Reduced costs • Accrued speed • Offers more possibilities in some cases it may be impossible to have a census (ex: quality control) • Perhaps more precise! Cases where highly qualified personal are necessary for collecting data
Sampling errors • Random error • different samples will produce different estimates of the study population characteristics • Systematic error - bias • non probabilistic sample • probabilistic sample with a high rate of non respondents • biased instrument of measure
TV Show Poll - March 1998 • Should Hamilton be renamed Waikato City? • 4400 dialled the 0900 number • 73% were against the change • What type of sample was taken? • What conclusions would you draw?
Bias vs variability • Bias is a systematic error, in the same direction, of successive estimations of a parameter. • Large variability means that repeated values of estimations are scattered; the results of successive sampling cannot be reproduced. • (see …)
a) large bias, low variability b) low bias, high variability c) large bias, high variability d) low bias, low variability
Bias due to non-response • Bias is often caused by non-response in surveys. • For example, suppose that the population is divided in two groups : respondents (60%) and non-respondents (40%). • Within respondents, 65% are in favour of a project et within non-respondents, 20% are in favour. • The real proportion in the population in favour of the project is p = 47% , while a survey will give an estimation of p at about 65% 47%. The bias is 18%.
How do we make a simple random sample drawing? • We need a list. Each element of the population is assigned a number from 1 to N. • We use a computer program to select n numbers as randomly as possible (ex: Excel, MINITAB, SAS, C). • The corresponding elements form the sample.
Notes : • The results obtained depend on the sample taken. • If the samples are taken according to codes of practice, the results should all be similar. • For a simple random draw, each individual of the population is as likely to be selected at each draw. • For a simple random draw, there are many different possible samples. All possible samples of the same size have the same chance of being selected.
Opinion polls • The results obtained in a probabilistic sample will be used to generalize the entire population. • But the fact of using a sample necessarily induces a margin of error that we will try to control. • We will distinguish two types of data: qualitative and quantitative.
Types of data • Qualitative (measurement scale: nominal or ordinal) (parameter: %) Examples: • sex (F, M) • political party (PLQ, PQ, ADQ) • preferred brand (Coke, Pepsi, Homemade brand, …) • satisfaction level (Likert scale from 1 to 5) • Quantitative (measurement scale: interval or ratio) (parameter: mean) Examples: • age • income • temperature (in degrees Celsius)
Case study • Data in credit.xls represent the credit balance and the total income of 100 randomly chosen families in Quebec. • What is the mean credit balance for a familyin Quebec? What is the precision (margin of error) of your estimate? • What about a Canadian family? • Assuming that 2 500 000 families use at least one credit card regularly, what is the total debt of families in Quebec?What is the precision of the estimate?