1 / 16

Virtual COMSATS Inferential Statistics Lecture-25

Recap of previous lectures on hypothesis testing, introduction to correlation and regression, simple correlation and its significance, properties of correlation, scatter plot, coefficient of determination, regression analysis, hypothesis testing for correlation coefficient.

lwakefield
Download Presentation

Virtual COMSATS Inferential Statistics Lecture-25

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Virtual COMSATSInferential StatisticsLecture-25 Ossam Chohan Assistant Professor CIIT Abbottabad

  2. Recap of previous lectures • We are working on hypothesis testing. And we discussed following topics in our last lectures: • Introduction to Hypothesis Testing. • Six Steps of hypothesis testing. • Hypothesis testing for single population i.e for mean and proportion. • Hypothesis testing for paired observations. • Chi Square distribution. • Test of independence. • Test for homogeneity. • Test for variances. • Goodness of fit test. • Fisher’s Exact Test. • F-Distribution. • ANOVA • One Way ANOVA. • Two Way ANOVA. • Multiple Comparison using LSD.

  3. Objective of lecture-25 • Introduction of Correlation and Regression. • Simple Correlation and its significance in research. • Solution to problems. • Properties of Correlation. • Scatter Plot. • Coefficient of determination. • Regression Analysis. • Simple Regression model. • Probable error and standard error. • Hypothesis testing for correlation coefficient.

  4. Correlation and Regression • Is there a relationship between x and y? • What is the strength of this relationship • Pearson’s r • Can we describe this relationship and use it to predict y from x? • Regression • Is the relationship we have described statistically significant? • F- and t-tests

  5. Discussion on Correlation

  6. Correlation • Correlation analysis is used to measure strength of the association (linear relationship) between two variables like fertilizer and yield. • Only concerned with strength of the relationship. • No causal effect is implied. • Sample correlation coefficient is represented by r.

  7. Scatter Plot A scatter plot is a graph of a collection of ordered pairs (x , y). A scatter plot (or scatter diagram) is used to show the relationship between two variables. Correlation can be represented using scatter plot. Why Scatterplot?

  8. Scatter Plot Examples Linear relationships Curvilinear relationships y y x x y y x x

  9. Scatter Plot Examples (continued) Strong relationships Weak relationships Can we show it numerically? y y x x y y x x

  10. Scatter Plot Examples (continued) No relationship y x y x

  11. Correlation Coefficient • The population correlation coefficient is shown by a Greek symbol ρ (pronounce it as “rho”). • It measures the strength of the association between the variables. • The sample correlation coefficient r is an estimate of ρ. • It is used to measure the strength of the linear relationship in the sample observations

  12. Properties of ρand r • Unit free • Values always lies between -1 and 1. • The closer to -1, the stronger the negative linear relationship. • The closer to 1, the stronger the positive linear relationship. • The closer to 0, the weaker the linear relationship.

  13. Calculating the Correlation Coefficient Sample correlation coefficient: In algebraic equivalent: where: r = Sample correlation coefficient n = Sample size-no of pairs of values x = Value of the independent variable y = Value of the dependent variable

  14. Problem-27 • The test-Retest method is one way of establishing the reliability of a test. The test is administered and then, at a later date, the same test is re-administered to the same individuals. Find the correlation coefficient between two sets of scores.

  15. Problem-27 Solution

  16. Coefficient of Determination, R2 • The coefficient of determination is the portion of the total variation in the dependent variable that is explained by variation in the independent variable • The coefficient of determination is also called R-squared and is denoted as R2 where Note: In the single independent variable case, the coefficient of determination is where: R2 = Coefficient of determination r = Simple correlation coefficient

More Related