1 / 18

Introduction to logistic regression

Introduction to logistic regression. When and why to use Logistic Regression?. The response variable has to be binary or ordinal. Predictors can be continuous, discrete, or combinations of variables. Predictors do not have to be normally distributed

sitara
Download Presentation

Introduction to logistic regression

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Introduction to logistic regression

  2. When and why to use Logistic Regression? • The response variable has to be binary or ordinal. • Predictors can be continuous, discrete, or combinations of variables. • Predictors do not have to be normally distributed • Predictors does not have to be linearly related. • estimated probabilities lie between 0 and 1. • Non-linear relationships between the response and predictors

  3. When and why to use Logistic Regression? • A non-parametric method that requires no specific distribution of the errors. • Offers easy model-building or variable selection procedures. • Parameter estimates are obtained by maximum likelihood methods

  4. { logit of y The Logistic Model • If (p) is the probability of the event , then odds of the event is : • The simple logistic model is based on a linear relationship between natural logarithm (ln) of the odds of an event and independent variables

  5. The Logistic Model • Using the laws of exponents and logs to express (p) in terms of L : Logistic Function and :

  6. The Logistic Model probability (s shaped curve) L=ln(o)

  7. Basic interpretation of . • When x1 = x and x2 = x +1, then the log odds changes by  amount • which means that, the odds becomes exp() times the original.

  8. Data Form The data could be collected either : • Individually ( binary data ) • As a group (if there are more observations on each x value) In this case, it is sufficient to report the total number of ‘1’s at each x value .

  9. Example 1: binary data

  10. Example 1: binary data OR P(mastitis)=1

  11. For age 22 month

  12. OR compares the odds of an event for two cows, one with and the one with With X values 1 unit apart :

  13. Odds ratios • Odds ratios range from 0 to positive infinity • O R < 1 = (P) < .50 , • OR > 1 = ( p ) > .50 .

  14. Deviance • Measure of deviation between the estimated and observed values • analogous to SS residual for linear model

  15. Example 2 : grouped data

  16. 95% confidence limits ? 95% confidence interval around the odds ratio :

  17. Summary of main points • Logistic regression model is used to analyze the association between a binary outcome and one or many determinants. • The determinants can be binary, categorical or continuous measurements • The model is logit (p) = log[p / (1-p)] = a + bX where X is a factor, and a and b must be estimated

  18. Thank you for your attention

More Related