1 / 17

Fighting for fame, scrambling for fortune, where is the end?

Fighting for fame, scrambling for fortune, where is the end? Great wealth and glorious honor, no more than a night dream. Lasting pleasure, worry-free forever, who can attain?. Categorical Data Analysis. Chapter 4: Introduction to Generalized Linear Models. 3-Way Tables.

lacey
Download Presentation

Fighting for fame, scrambling for fortune, where is the end?

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Fighting for fame, scrambling for fortune, where is the end? Great wealth and glorious honor, no more than a night dream. Lasting pleasure, worry-free forever, who can attain?

  2. Categorical Data Analysis Chapter 4: Introduction to Generalized Linear Models

  3. 3-Way Tables

  4. Marginal vs. Conditional • Marginal independence: marginal odds ratio =1 • Conditional independence: conditional odds ratios =1 • The observed effect of X on Y might simply reflect effects of other covariates on both X and Y

  5. Generalized Linear Models (GLM) 3 components: • Random component (Y1, Y2, … , Yn): n independent observations from a distribution in the exponential family (not necessary to be i.i.d.): for i=1,2,…,n, where a, b, c are all positive functions. e.g. poisson, binomial, exponential, normal

  6. Systematic component: the right hand side of the model equation; often a linear combination of explanatory variables: Or in matrix format or X’B

  7. Link component: the link between u(=E(Y)) and X’B In a model equation g(u)=X’B, g(.) is called the link function, a monotonic differentiable function • Most common link: Canonical link e.g. normal, binomial, poisson

  8. Canonical Link • Normal data: identity link • Binary data: logit link • Count data: log link

  9. Deviance • Deviance measures the loss of information from data reduction • Saturated model: the most general model which fits each observation • Could be each subject’s response or • Could be each cell frequency

  10. GLMs for Binary Data • Linear probability model (identity link) • Logistic regression model (logit link) • Probit model (probit link)

  11. Example: Snoring vs. Heart Disease Note: The fits will change if relative spacings between scores change.

  12. Example: 2x2 Tables • Binary covariate X and response Y • Logit link GLM:

  13. GLMs for Count Data • Poisson loglinear model • Count data: certain events occur over time, space or alike, e.g. the # of car accidents of a random sample of 100 drivers in 2005 • Rate data: count/(time or space or alike), e.g. the car accident rates of a random sample of 100 drivers in 2005 (Sec. 9.7.1, p. 385)

  14. Example: Horseshoe Crabs Data: Table 4.3 (p. 127) • Y= # of satellites • X= carapace width • Poisson loglinear model • Poisson GLM with identity link

  15. Inference for GLMs • Goodness of fit: • Measure: deviance • Test: Likelihood-Ratio (LR) tests • Model comparison: L-R tests • Residuals: Pearson and standardized

  16. Example: Insecticide vs Beetles

  17. Types of Models for Statistical Analysis

More Related