1 / 32

Regression Concept & Examples, Latent Variables, & Partial Least Squares (PLS)

Regression Concept & Examples, Latent Variables, & Partial Least Squares (PLS). Simple Regression Model. Make prediction about the starting salary of a current college graduate Data set of starting salaries of recent college graduates. Data Set. Compute Average Salary.

zia-york
Download Presentation

Regression Concept & Examples, Latent Variables, & Partial Least Squares (PLS)

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Regression Concept & Examples,Latent Variables,&Partial Least Squares (PLS)

  2. Simple Regression Model • Make prediction about the starting salary of a current college graduate • Data set of starting salaries of recent college graduates Data Set Compute Average Salary How certain are of this prediction? There is variability in the data.

  3. Simple Regression Model • The smaller the amount of total variation the more accurate (certain) will be our prediction. • Use total variation as an index of uncertainty about our prediction Compute Total Variation

  4. Simple Regression Model • How “explain” the variability - Perhaps it depends on the student’s GPA Salary GPA

  5. Simple Regression Model • Find a linear relationship between GPA and starting salary • As GPA increases/decreases starting salary increases/decreases

  6. Simple Regression Model • Least Squares Method to find regression model • Choose a and b in regression model (equation) so that it minimizes the sum of the squared deviations – actual Y value minus predicted Y value (Y-hat)

  7. Simple Regression Model a= 4,779 & b = 5,370 A computer program computed these values • How good is the model? • u-hat is a “residual” value • The sum of all u-hats is zero • The sum of all u-hats squared is the total variance not explained by the model • “unexplained variance” is 7,425,926

  8. Simple Regression Model Total Variation = 23,000,000

  9. Simple Regression Model Total Unexplained Variation = 7,425,726

  10. Simple Regression Model • Relative Goodness of Fit • Summarize the improvement in prediction using regression model • Compute R2 – coefficient of determination Regression Model (equation) a better predictor than guessing the average salary The GPA is a more accurate predictor of starting salary than guessing the average R2 is the “performance measure“ for the model. Predicted Starting Salary = 4,779 + 5,370 * GPA

  11. Detailed Regression Example

  12. Data Set

  13. Scatter Plot - GPA vs Salary

  14. Scatter Plot - Work vs Salary

  15. Pearson Correlation Coefficients-1 <= r <= 1

  16. Three Regressions • Salary = f(GPA) • Salary = f(Work) • Salary = f(GPA, Work) • Interpret Excel Output

  17. Interpreting Results • Regression Statistics • Multiple R, • R2, • R2adj • Standard Error Sy • Statistical Significance • t-test • p-value • F test

  18. Regression Statistics Table • Multiple R • R = square root of R2 • R2 • Coefficient of Determination • R2adj • used if more than one x variable • Standard Error Sy • This is the sample estimate of the standard deviation of the error (actual – predicted)

  19. ANOVA Table • Table 1 gives the F statistic • Tests the claim • there is no significant relationship between your all of your independent and dependent variables • The significance F value is a p-value • should reject the claim: • Of NO significant relationship between your independent and dependent variables if p< • Generally  = 0.05

  20. Regression Coefficients Table • Coefficients Column gives • b0 , b1, ,b2 , … , bn values for the regression equation. • The b0 is the intercept • b1value is next to your independent variable x1 • b2 is next to your independent variable x2. • b3 is next to your independent variable x3

  21. Regression Coefficients Table • p values for individual t tests each independent variables • t test - tests the claim that there is no relationship between the independent variable (in the corresponding row) and your dependent variable. • Should reject the claim • Of NO significant relationship between your independent variable (in the corresponding row) and dependent variable if p<.

  22. Salary = f(GPA)

  23. Salary = f(Work)

  24. Salary = f(GPA, Work)

  25. Compare Three “Models”

  26. Latent Variables(Theoretical Entities)

  27. Latent Variables • Latent Variables • Explanatory Variables that are not directly measured • Identified by “Exploratory Factor Analysis” • Confirmed by “Confirmatory Factor Analysis” • Statistical Methods for Latent Variables • Principles Components Analysis • PLS • SEM

  28. Example: Confirmatory Factor AnalysisIntention to Use Travelocity Website

  29. Research Instrument

More Related