1 / 28

Chapter 16

ECON 6002 Econometrics Memorial University of Newfoundland. Qualitative and Limited Dependent Variable Models. Chapter 16. Adapted from Vera Tabakova’s notes. Chapter 16: Qualitative and Limited Dependent Variable Models. 16.1 Models with Binary Dependent Variables

chas
Download Presentation

Chapter 16

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. ECON 6002 Econometrics Memorial University of Newfoundland Qualitative and Limited Dependent Variable Models Chapter 16 Adapted from Vera Tabakova’s notes

  2. Chapter 16: Qualitative and Limited Dependent Variable Models • 16.1 Models with Binary Dependent Variables • 16.2 The Logit Model for Binary Choice • 16.3 Multinomial Logit • 16.4 Conditional Logit • 16.5 Ordered Choice Models • 16.6 Models for Count Data • 16.7 Limited Dependent Variables: Heckman selection model Principles of Econometrics, 3rd Edition

  3. 16.7.6 Sample Selection • Problem: our sample is not a random sample. The data we observe are “selected” by a systematic process for which we do not account • We wonder about the relationship between x and y but data are available only for observations in which another variable, z*, exceeds a certain value • Selection bias occurs when your sample is truncated and the cause of that truncation is correlated with the dependent variable Principles of Econometrics, 3rd Edition

  4. 16.7.6 Heckit • Solution: a technique called Heckit, named after its developer, James Heckman • Heckman was awarded the Nobel Prize in 2000 for this contribution: “for his development of theory and methods for analyzing selective samples”. although his contribution to economics is gigantic (he is considered one of the ten most influential economists alive) • Who did he fly to Stockholm with? Principles of Econometrics, 3rd Edition

  5. 16.7.6 Heckit Other examples of selection issues: • Will GRE scores help us screen MA applicants for 2014? • Does getting married before “shacking up” help keep marriages off divorce proceedings? • Planes coming back from the war with bullet holes? Principles of Econometrics, 3rd Edition

  6. 16.7.6 Sample Selection • For example, you go to George St. to collect data on the drinking habits of MUN students (very convenient but that is why convenience samples are not valid for general inference!) • To the extent that the likelihood of someone being there is somehow related to the number of drinks they are going to have you would have a sample selection issue • We want our sampling to be random or at least due to some exogenous sampling • Endogenous sampling leads to inconsistent and biased estimation Principles of Econometrics, 3rd Edition

  7. 16.7.6 Sample Selection • Sample selection can arise in many settings and for different reasons, so there are many “sample selection models” • For example, selection into the sample may be due to self-selection, with the outcome of interest determined in part by individual choice of whether or not to participate in the activity of interest • That is why you want your census to be compulsory!  Principles of Econometrics, 3rd Edition

  8. 16.7.6 Sample Selection • The Tobit model can be considered a type of basic selection models too • More flexible extensions of Tobit are what most people refer to as sample selection models • A simple extension is to consider a bivariate sample selection model (as labelled by Cameron and Trivedi), which generalizes the Tobit model by introducing a censoring latent variable that differs from the latent variable generating the outcome of interest • Example: there needs to be something else other than the desired number of hours to supply prompting wives to go to work • Amemiya calls this model the Tobit II (while we already know about the Tobit I above) Principles of Econometrics, 3rd Edition

  9. 16.7.6 Sample Selection • Consistent estimation the under sample selection on unobservablesrelieson quite strong distributional assumptions • Experimental would allow us to avoid selection problems by using random assignment to a treatment • However, experiments can be difficult to implement in economics applications for both cost and ethical reasons • The treatment effects approach attempts to apply the experimental approach to observational data (See Cameron and Trivedi MMA, Ch 25) • There is an increasing number of works dealing with this type of approach Principles of Econometrics, 3rd Edition

  10. 16.7.6 Sample Selection • We will focus on this simple Tobit II model/bivariate sample selection model/Heckman model/Heckit/Tobitmodelwithstochastic threshold … • There’s more!!! • Wooldridge calls the model one with a probit selection equation. • Others call this model the generalized Tobit model • Others call it simply “the” selection model but there are may selection models Principles of Econometrics, 3rd Edition

  11. 16.7.6 Sample Selection • Let y∗2 denote the outcome of interest (say the wives’ wages or inour initial example how many hours to work) • Tobit assumes that this outcome is observed if y∗2 > 0 • A more general model uses a different latent variable, y∗1 , such that y∗2 is observed if y∗1 > 0 • y∗1 determines whether to work or not BUT NOT how much to work, y∗2 does Principles of Econometrics, 3rd Edition

  12. 16.7.6 Sample Selection • The Heckit technique famously takes into account that the decision to work may be correlated with the expected wage as in the mroz.dta example • We only observe the wages of women who do work, the non-working wives we also observe but we have no salary for them • If the reason why the decision to work is somehow related to some unobservable characteristic that also affects their wage, we are in trouble • Although the decision to work could be informed by many other things that have nothing to do with the wages ones does earn, wives work if the salary they are offered exceeds their reservation wage…so clearly we have an issue! Principles of Econometrics, 3rd Edition

  13. 16.7.6 Sample Selection Classic application was to labor supply, where • y∗1 is the unobserved desire or propensity to work, • y2 is actual hours worked • See Mroz (1987) Principles of Econometrics, 3rd Edition

  14. 16.7.6a The Econometric Model • The econometric model describing the situation is composed of two equations. The first, is the selection equation/participation equation that determines whether the variable of interest is observed. Principles of Econometrics, 3rd Edition

  15. 16.7.6a The Econometric Model • The second equation is the linear model of interest. It is run only on the observations for which we have information on y Principles of Econometrics, 3rd Edition

  16. 16.7.6a The Econometric Model • The estimated “Inverse Mills Ratio” is • The estimating equation is This helps us cover for the missing information the omitted variable that was biasing OLS A test of whether or not the errors are correlated and sample selection correction is needed can be built as a Wald test of the estimated coefficient of the inverse Mills ratio Principles of Econometrics, 3rd Edition

  17. 16.7.6a The Econometric Model • The estimated “Inverse Mills Ratio” is • The estimating equation is This helps us cover for the missing information the omitted variable that was biasing OLS Both the usual OLS standard errors and heteroskedasticity-robust standard errors reported from the regression if done manually are incorrect, use Heckit software! Principles of Econometrics, 3rd Edition

  18. 16.7.6b Heckit Example: Wages of Married Women Principles of Econometrics, 3rd Edition

  19. 16.7.6b Heckit Example: Wages of Married Women • The maximum likelihood estimated wage equation is The standard errors based on the full information maximum likelihood procedure are smaller than those yielded by the two-step estimation method. Principles of Econometrics, 3rd Edition

  20. 16.7.6 Sample Selection • We use two data different generation processes to explain the decision to work and the wage • But there is a subtle difference relative to the Cragg model • We here have a third (unobservable) element explaining the decision to work • If that element is also in the error of the main equation, we have a problem of sample selection Principles of Econometrics, 3rd Edition

  21. 16.7.6 Sample Selection • Heckitwith normal errors is theoretically identified without any restriction on the regressors • In principle, same regressors can appear in the equations for y∗ 1 and y∗2 • In practice you want exclusion restrictions (something in the participation equation that is not in the outcome equation) • Otherwise you would be relying only on the nonlinearity of the inverse Mills ration for identification and the inverse Mills ratio term is actually approximately linear over a wide range of its argument, leading to multicollinearity issues  • The problem is less severe the better a probit model can discriminate between participants and nonparticipants Principles of Econometrics, 3rd Edition

  22. 16.7.6 Sample Selection • The problem is less severe the better a probit model can discriminate between participants and nonparticipants • You would prefer to use them but it can be very difficult to make defensible exclusion restrictions Principles of Econometrics, 3rd Edition

  23. 16.7.6 Sample Selection • Extensions • Plenty! And variations of the main model • And different names for the same models!  • Example: what is the selection process is not just a binary decision but an ordered one: oheckman (Chiburis, R. and M. Lokshin (2007)) Principles of Econometrics, 3rd Edition

  24. 16.7.6 Sample Selection • Extensions • Heckit handles linear regression models when there is a selection mechanism • However, if the outcome equation involves a dichotomous dependent variable too, we would have a probit selection equation and a probit outcome equation • That ‘double probit model’/‘bivariate probit model with selection’ can be estimated with heckprob in STATA Principles of Econometrics, 3rd Edition

  25. 16.7.6 Sample Selection • Extensions • A simpler version is the bivariate probit (biprobit) Principles of Econometrics, 3rd Edition

  26. Keywords • binary choice models • censored data • conditional logit • count data models • feasible generalized least squares • Heckit • identification problem • independence of irrelevant alternatives (IIA) • index models • individual and alternative specific variables • individual specific variables • latent variables • likelihood function • limited dependent variables • linear probability model • logistic random variable • logit • log-likelihood function • marginal effect • maximum likelihood estimation • multinomial choice models • multinomial logit • odds ratio • ordered choice models • ordered probit • ordinal variables • Poisson random variable • Poisson regression model • probit • selection bias • tobit model • truncated data Principles of Econometrics, 3rd Edition

  27. Further models Survival analysis (time-to-event data analysis) Multivariate probit (biprobit, triprobit, mvprobit)

  28. References Hoffmann, 2004 for all topics Long, S. and J. Freese for all topics Cameron and Trivedi’s book for count data Agresti, A. (2001) Categorical Data Analysis (2nd ed). New York: Wiley.

More Related