1 / 7

Multiple Regression in Practice

Multiple Regression in Practice. The value of outcome variable depends on several explanatory variables. F -test. To judge whether the explanatory variables in the model adequately describe the outcome variable.

kagami
Download Presentation

Multiple Regression in Practice

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Multiple Regression in Practice • The value of outcome variable depends on several explanatory variables. • F-test. To judge whether the explanatory variables in the model adequately describe the outcome variable. • t-test. Applies to each individual explanatory variable. Significant t indicates whether the explanatory variable has an effect on outcome variable while controlling for other X’s. • T-ratio. To judge the relative importance of the explanatory variable.

  2. Basic Assumptions • Mean value of the outcome variable for a set of explanatory variables is described by the regression equation. • Normal distribution of values around the regression line. • Variance around the regression line is the same for all values of the explanatory variables. • The explanatory variables are not correlated.

  3. Problem of Multicollinearity • When explanatory variables are correlated there is difficulty in interpreting the effect of explanatory variables on the outcome. Check by: • Correlation coefficient matrix (see next slide). • F-test significant with insignificant t. • Large changes occur in the regression coefficients when variables are added or deleted. (Variance Inflation). Vi > 4 or 5 means there is multicollinearity.

  4. Example of a Matrix Plot • This matrix plot comprises several scatter plots to provide visual information as to whether variables are correlated • The arrow points at a scatter plot where two explanatory variables are strongly correlated

  5. Selecting the most Economic Model The purpose is to find the smallest number of explanatory variables which make the maximum contribution to the outcome. • After excluding variables that may be causing multicollinearity, examine the table of t-ratios in the full model. Those variables with a significant t are included in the sub-set. • In the Analysis of Variance table examine the column headed SEQ SS. Check that the candidate variables are indeed making a sizable contribution to the Regression Sum of Squares

  6. Stepwise Regression Analysis • Stepwise finds the explanatory variable with the highest R2 to start with. It then checks each of the remaining variables until two variables with highest R2 are found. It then repeats the process until three variables with highest R2 are found, and so on. • The overall R2 gets larger as more variables are added. • Stepwise may be useful in the early exploratory stage of data analysis, but not to be relied upon for the confirmatory stage.

  7. Is the Model Adequate? Judged by the following: • R2 value. Increase in R2 on adding another variable gives a useful hint • Adjusted R2 is a more sensitive measure. • Smallest value of s (standard deviation). • C-p statistic. A model with the smallest C-p is used such that Cp value is closest to p (the number of parameters in the model)

More Related