E N D
Introduction In previous lessons, we have plotted and analyzed data that appears to have a linear relationship. The data points in some data sets were very close to a linear model, while other data sets had points that were farther from the linear model. The strength of the relationship between data that has a linear trend can be analyzed using the correlation coefficient. 4.3.2: Calculating and Interpreting the Correlation Coefficient
Introduction, continued A correlation is a relationship between two events, such as x and y, where a change in one event implies a change in another event. The correlation coefficient, r, is a quantity that allows us to determine how strong this relationship is between two events. It is a value that ranges from –1 to 1; a correlation coefficient of –1 indicates a strong negative correlation, a correlation coefficient of 1 indicates a strong positive correlation, and a correlation coefficient of 0 indicates a very weak or no linear correlation. 4.3.2: Calculating and Interpreting the Correlation Coefficient
Introduction, continued You will use a calculator to calculate the correlation coefficient. Note that a correlation between two events does not imply that changing one event causes a change in the other event—only that a change might have taken place in the other event. This will be explored more later. 4.3.2: Calculating and Interpreting the Correlation Coefficient
Key Concepts A correlation is a relationship between two events, where a change in one event implies a change in another event. Correlation doesn’t mean that a change in the first event caused a change in the other event. The strength of a linear correlation can be measured using a correlation coefficient. 4.3.2: Calculating and Interpreting the Correlation Coefficient
Key Concepts, continued Before determining the correlation coefficient, look at the scatter plot of the data and make an initial assessment of the strength of a linear relationship between the two events. To calculate the correlation coefficient on a graphing calculator, follow the steps on the following slides. 4.3.2: Calculating and Interpreting the Correlation Coefficient
Key Concepts, continued On a TI-83/84: Step 1: Set up the calculator to find correlations. Press [2nd], then [CATALOG] (the “0” key). Scroll down and select DiagnosticOn, then press [ENTER]. (This step only needs to be completed once; the calculator will stay in this mode until changed in this menu.) Step 2: To calculate the correlation coefficient, first enter the data into a list. Press [2nd], then L1 (the “1” key). Scroll to enter data sets. Press [2nd], then L2 (the “2” key). Enter the second event in L2. 4.3.2: Calculating and Interpreting the Correlation Coefficient
Key Concepts, continued Step 3: Calculate the correlation coefficient. Press [STAT], then select CALC at the top of the screen. Scroll down to 8:LinReg(a+bx), and press [ENTER]. The r value (the correlation coefficient) is displayed along with the equation. 4.3.2: Calculating and Interpreting the Correlation Coefficient
Key Concepts, continued On a TI-Nspire: Step 1: Go to the lists and spreadsheet page. The icon looks like a table. Step 2: Enter the data into the first column underneath the shaded row, pressing [enter] after each data value. Step 3: Use the nav pad to arrow up to the first row below the shaded row and then arrow over to the right so that you are in the second column. Enter the data values, pressing [enter] after each data value. 4.3.2: Calculating and Interpreting the Correlation Coefficient
Key Concepts, continued Step 4: Press the [menu] key. Step 5: Arrow down to 4: Statistics and press the center click key. Step 6: Press the center click key again to select 1: Stat Calculations. Step 7: Choose 3: Linear Regression (mx+b). Step 8: At the X List field, press [clear] and then type in “a[]”. To type “[]”, press the [ctrl] key and then the [(] key. Step 9: Press [tab] to go the Y List field and type in “b[]”. 4.3.2: Calculating and Interpreting the Correlation Coefficient
Key Concepts, continued Step 10: Press [tab] to go the Results field and check that results are listed in “c[]”. If not, change them. Step 11: Press [tab] to “OK” and press the center click key. Step 12: Arrow down until you see the “r” and look to the right. The number to the right is the correlation coefficient, r. 4.3.2: Calculating and Interpreting the Correlation Coefficient
Key Concepts, continued A correlation coefficient of –1 indicates a strong negative correlation. A correlation coefficient of 1 indicates a strong positive correlation. A correlation coefficient of 0 indicates a very weak or no linear correlation. The correlation coefficient only assesses the strength of a linear relationship between two variables. The correlation coefficient does not assess causation—that one event causes the other. 4.3.2: Calculating and Interpreting the Correlation Coefficient
Common Errors/Misconceptions using the correlation coefficient to analyze data that is not linear incorrectly using the correlation coefficient to assess the strength of a relationship 4.3.2: Calculating and Interpreting the Correlation Coefficient
Guided Practice Example 1 An education research team is interested in determining if there is a relationship between a student’s vocabulary and how frequently the student reads books. The team gives 20 students a 100-question vocabulary test, and asks students to record how many books they read in the past year. The results are in the table on the next slide. Is there a linear relationship between the number of books read and test scores? Use the correlation coefficient, r, to explain your answer. 4.3.2: Calculating and Interpreting the Correlation Coefficient
Guided Practice: Example 1, continued 4.3.2: Calculating and Interpreting the Correlation Coefficient
Guided Practice: Example 1, continued Create a scatter plot of the data. Let the x-axis represent books read and the y-axis represent test score. 4.3.2: Calculating and Interpreting the Correlation Coefficient
Guided Practice: Example 1, continued Test score Books read 4.3.2: Calculating and Interpreting the Correlation Coefficient
Guided Practice: Example 1, continued Describe the relationship between the data using the graphical representation. It appears that the higher scores were from students who read more books, but the data does not appear to lie on a line. There is not a strong linear relationship between the two events. 4.3.2: Calculating and Interpreting the Correlation Coefficient
Guided Practice: Example 1, continued Calculate the correlation coefficient on your graphing calculator. Refer to the steps in the Key Concepts section. The correlation coefficient, r, is approximately 0.48. 4.3.2: Calculating and Interpreting the Correlation Coefficient
Guided Practice: Example 1, continued Use the correlation coefficient to describe the strength of the relationship between the data. A correlation coefficient of 1 indicates a strong positive correlation, and a correlation of 0 indicates no correlation. A correlation coefficient of 0.48 is about halfway between 1 and 0, and indicates that there is a weak positive linear relationship between the number of books a student read in the past year and his or her score on the vocabulary test. ✔ 4.3.2: Calculating and Interpreting the Correlation Coefficient
Guided Practice: Example 1, continued 4.3.2: Calculating and Interpreting the Correlation Coefficient
Guided Practice Example 3 Caitlyn thinks that there may be a relationship between class size and student performance on standardized tests. She tracks the average test performance of students from 20 different classes, and notes the number of students in each class in the table on the next slide. Is there a linear relationship between class size and average test score? Use the correlation coefficient, r, to explain your answer. 4.3.2: Calculating and Interpreting the Correlation Coefficient
Guided Practice: Example 3, continued 4.3.2: Calculating and Interpreting the Correlation Coefficient
Guided Practice: Example 3, continued Create a scatter plot of the data. Let the x-axis represent the number of students in each class and the y-axis represent the average test score. 4.3.2: Calculating and Interpreting the Correlation Coefficient
Guided Practice: Example 3, continued Average test score Number of students 4.3.2: Calculating and Interpreting the Correlation Coefficient
Guided Practice: Example 3, continued Describe the relationship between the data using the graphical representation. As the class size increases, the average test score decreases. It appears that there is a linear relationship with a negative slope between the two variables. 4.3.2: Calculating and Interpreting the Correlation Coefficient
Guided Practice: Example 3, continued Calculate the correlation coefficient on your graphing calculator. Refer to the steps in the Key Concepts section. The correlation coefficient, r, is approximately –0.84. 4.3.2: Calculating and Interpreting the Correlation Coefficient
Guided Practice: Example 3, continued Use the correlation coefficient to describe the strength of the relationship between the data. A correlation coefficient of –1 indicates a strong negative correlation, and a correlation of 0 indicates no correlation. A correlation coefficient of –0.84 is close to –1, and indicates that there is a strong negative linear relationship between class size and average test score. ✔ 4.3.2: Calculating and Interpreting the Correlation Coefficient
Guided Practice: Example 3, continued 4.3.2: Calculating and Interpreting the Correlation Coefficient