1 / 22

Research Method

Research Method. Lecture 5 (Ch6) Multiple regression: Further issues. Effects of data scaling on OLS statistics. Consider the following labor supply equation for married women. Hours = β 0 + β 1 (non wife income)+ β 2 (experience)+ β 3 (education)+ β 4 (# kids aged less than 6)+ u

akira
Download Presentation

Research Method

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Research Method Lecture 5 (Ch6) Multiple regression: Further issues

  2. Effects of data scaling on OLS statistics. • Consider the following labor supply equation for married women. Hours =β0+ β1(non wife income)+ β2(experience)+ β3(education)+ β4(# kids aged less than 6)+u Hours: the woman’s annual hours worked Non wife income: wife’s non labor income (such as husband’ s income) in $1000. Estimate the model using MROZ.dta

  3. Question: What is the effect of measuring non-wife-income in $1 instead of in $1000? • What is the effect of measuring wife’s labor supply in days-equivalent (i.e., days=hours/8)?

  4. Standardizing a variable • The effects of some variables, such as test scores, are often difficult to interpret: For example, what does it mean by the effect of increasing the test score by 1 point on wage? • In such case, it is often better to standardize the variable first, then include it in the model. That is; include the following variable instead.

  5. Then coefficient on the variable will tell you the effect when test score increases by one standard deviation from the mean.

  6. Example Using HPRICE2.dta, estimate the effect of environment (measured by nox) on housing price. Standardize nox before including in the model. In the model, include (crime), (dist), (rooms) and (stratio).

  7. Including quadratic terms • Often you would like to capture ‘diminishing marginal returns’. For example, the effect of experience on wage may face diminishing marginal returns. • To capture such effects, include quadratic term. Log(wage) =β0+β1(experience)+β2(experience)2+u If β2is negative, experience faces diminishing marginal returns.

  8. Note that the effect of education is given by ∂log(wage)/∂(experience) =β1+2β2(experience) So, the effect changes with experience. Exercise 1: Use MROZ.dta, estimate the above equation. What is the effect of increasing experience by 1 year on wage evaluated at the average experience? Exercise 2: What is the effect of increasing experience by 1 year on wage for those with experience equal to 1?

  9. Interaction terms • Consider you estimate the following production function Log(Q)=β0+β1log(labor)+β2log(capital)+u This model assumes that the effect of labor on output is independent of the effect of capital. But, workers may be more productive if they have more capital. So in reality, there is an interaction effect between labor and capital.

  10. To capture such interaction effects, you can include an interaction term, like: Log(Q)=β0+β1log(labor)+β2log(capital) +β3log(labor)log(capital)+ u The effect of labor on output is now give as ∂log(Q)/∂log(labor) =β1+β3log(capital) So the effect of labor depends on the amount of capital.

  11. Exercise: using HPRICE1.dta, estimate the following model Price=β0+β1sqrft+β2bdrms+β3(sqrft)(bdrms)+ u Question 1: Is there positive or negative interaction effect between the size of the house and the # of bedrooms? Question 2: What is the effect (sqrft) on price of house evaluated at the average # of bedrooms? Question 3: What is the effect of (bdrms) on price of house evaluated at the average size of house?

  12. Adjusted R-squared Almost but different • The usual R-squared is given by Almost but different • The adjusted R-squared is given by It is equal to It is equal to

  13. The usual R squared always increases if you add additional variables (even if it does not make sense to add some variables). • The adjusted R squared imposes a penalty to adding additional variables because it divides SSR by (n-k-1). • The adjust R squared can also be written as

  14. Controlling for too many factors in regression • After learning the omitted variable bias, one may be tempted to control for as many factors as possible. • But this often lead students to control for factors that shouldn’t be controlled for. • Next slide shows the example.

  15. Suppose you would like to see the effect of beer tax on traffic fatalities. The idea is that beer tax will reduce beer consumption, which would lead to fewer fatalities. So you may estimate (Fatalities)=β0+β1(beer tax) +β2(Percentage of male in town) +β3(percentage of young drivers) +β(other variables)+u

  16. Question is whether you should also control beer consumption. Like: (Fatalities)=β0+β1(beer tax) +β2(beer consumption) +β3(Percentage of male in town) +β4(percentage of young drivers) +β(other variables)+u • The answer to this question is NO. Beer tax would affect fatalities mainly by reducing beer consumption. So, it does not make sense to hold beer consumption constant when you examine the effect of beer tax on fatalities.

  17. Different models serve different purposes • Suppose that you would like to estimate the gender salary gap among academic economists. One possible model is to include female dummy together with rank dummies. Log(salary)=β0+β1(Female) +β2(FullProf) +β3(AssocProf) +(other variables)+u

  18. The above model estimates the gender salary gap while holding rank constant. • Thus, the female coefficient captures the gender salary gap within each rank. • However, females may be discriminated in terms of promotions as well, and this would indirectly cause gender salary gap.

  19. If you would like to evaluate the gender salary gap that is caused by (i) discrimination in terms of salary and (ii) the discrimination in terms of promotion combined, it makes sense to drop rank variables from the model. • When you drop rank variables (i.e., FullProf and AssocProf), then female coefficient will show the salary gap that is caused by salary discrimination and promotion discrimination combined.

More Related