1 / 14

Week 6: Model selection

Week 6: Model selection. Overview Questions from last week Model selection in multivariable analysis -bivariate significance -interaction and confounding Discussion of the 3 articles Data analysis discussion. Univariate, bivariate, and multivariate analysis: a review.

trapper
Download Presentation

Week 6: Model selection

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Week 6: Model selection Overview Questions from last week Model selection in multivariable analysis -bivariate significance -interaction and confounding Discussion of the 3 articles Data analysis discussion

  2. Univariate, bivariate, and multivariate analysis: a review

  3. Univariate, bivariate, and multivariate analysis: a review

  4. Univariate, bivariate, and multivariate analysis: a review

  5. Back to the mathematical model • In linear regression Y’ (known as Y prime) is the predicted value on the outcome variable • A is the Y axis intercept • β1 is the coefficient assigned through regression • X1 is the unit of the exposure variable • For logistic regression the model is: • ln ( Y’ ) =A + β1X1 + β2X2 +β3X3 • 1-Y’

  6. Model selection • A ‘full’ model is one that includes all the variables • A ‘null’ model is one that includes only the intercept • Selection of which variables to include can be done by you, by the computer, or both • Types of selection: • Forward, backward, stepwise

  7. Backward selection • Starts with a full model • Removes variables starting with the least significant variable • Often the best approach to start with

  8. What do you get when you cross a statistician with a chiropractor? • You get an adjusted R squared from a BACKward regression problem!

  9. Forward selection • Starts with a null model • Enters the variables into the model starting with the most significant • Can miss important associations or interactions

  10. Stepwise selection • Starts with a full or null model (usually a full model or backwards stepwise) • Adds or removes variables based on their significance in the model • Looks at variable itself and the relationship with other in the model • Can be considered the best automatic model selection especially with many exposure variables

  11. Maximum likelihood model fitting • Most logistic regression models use the maximum likelihood model to fit regression models • The log-likelihood is calculated based on predicted and actual outcomes A good model has a NON-significant LL • A goodness-of-fit chi-square is calculated (usually compares a constant-only model to the one you created)-2LL in null model - -2LL in your model with df = number of exposure variable • A good model has a significant goodness of fit

  12. Linear regression model fitting • Uses the same principles as logistic regression • Often starts with a full model • You need to examine 2 things:-the r2 and adjusted r2-changes in significance of each variable as the model changes • The goal is to achieve the model with the highest adjusted r2

  13. Confounding and effect modification • Confounding is classified as a variable that is associated with the exposure variable and the outcome variable, but is not on the causal pathway • E.g. smoking can be a confounding variable in the relationship between drinking alcohol and oral cancer • Effect modification is when the variable has a different effect in subgroups of the population • E.g., the effectiveness of a form to reduce medication errors can depend on whether the form is for home or the ED • These need to be considered when fitting a regression model

  14. For next week • Read articles • Start modelling your own data using the appropriate multivariable technique • Think about model selection, interactions and possibility of confounding

More Related