330 likes | 453 Views
CADA Final Review. Assessment Continuous assessment ( 10 %) Mini-project (20%) Mid-test (20%) F inal Examination ( 5 0%) 40% from Part 1 & 2 60% from Part 3 & 4. Main contents. Getting Started with SPSS Describing Data Testing Hypothesis
E N D
CADA Final Review Assessment • Continuous assessment (10%) • Mini-project (20%) • Mid-test (20%) • Final Examination (50%) 40% from Part 1 & 2 60% from Part 3 & 4
Main contents • Getting Started with SPSS • Describing Data • Testing Hypothesis • Examining Relationships
Part 1: Getting Started with SPSS Try to open the SPSS data file demo.sav. SPSS example files can be found C:\Program Files\SPSSInc\Statistics17\Samples\English This data file is a fictitious survey of several thousand people, containing basic demographic and consumer information. In Data View, columns represent variables, and rows represent cases (observations).
Construct a SPSS data file 1. By entering data directly 2. By reading from other applications In Variable View, each row is a variable, and each column is an attribute that isassociated with that variable.
DATA Nominal Scale (Quantitative) ordinal (type of car owned) discrete continuous (number of children) (time of an exam) Summary of Types of Variables Categorical Data
A simple frequency table • The “missing” item tells us how many people did not select one of the two valid answers.
Histogram (直方图) It is a histogram for grouped numerical data in which the frequencies or percentages of each group of numerical data are represented as individual bars.
Stem-and-leaf plots completion time in hours Stem-and-Leaf Plot for agecat6= 45-49 Frequency Stem & Leaf 2.00 2 . 99 13.00 3 . 0022223344444 40.00 3 . 555566777777788888888899999999999999999 35.00 4 . 00000001111111122222233333333334444 21.00 4 . 555666666777778888899 12.00 5 . 000111111234 9.00 5 . 667778889 4.00 6 . 0011 4.00 Extremes (>=6.2) Stem width: 1.00 Each leaf: 1 case(s)
Test Relationship between Scale & Categorical VariablesCompare Means
Age, Education, and Internet Use • Internet use by age (statistics for subgroups)
ANOVA Table • The F test shows that there is a significant difference among average hours worked per week in five categories of education.
Testing a single mean • The standard error of the mean is • The t -statistic The 95% confidence interval of the difference is
Testing a Hypothesis about Two related means • This problem is recommended to use the paired-samples t test.
Testing Two Independent Means • Output from t test for TV watching hours
Linear regression model • The regression model becomes • life expectancy=90-(0.70 x birthrate) That tells us that for an increase of 1 in birthrate, there is a decrease in life expectancy of 0.70 years.
Crosstabulation • Contingency Table by the use of time and education • Here the percentages are column %
The Chi-Square Test Statistic The test statistic is:
Chi-square Test on Independence • Since the p-value=0.00012<0.05, you reject the nullhypothesis of independence. • There is strong evidence of a relationship between primary reason for not returning and the hotel.