330 likes | 404 Views
Nonlinear Trend in Inequality of Educational Opportunity in the Netherlands 1930-1989. Maarten L. Buis Harry B.G. Ganzeboom. Outline. Background and research problem Main results Model selection Continuous or discrete measures parental education and father’s occupational status
E N D
Nonlinear Trend in Inequality of Educational Opportunity in the Netherlands 1930-1989 Maarten L. Buis Harry B.G. Ganzeboom
Outline • Background and research problem • Main results • Model selection • Continuous or discrete measures parental education and father’s occupational status • Importance mother’s education relative to father’s education • Difference in effect between sons and daughters • Non-linearity in trend in effects: identify periods of negative, positive, and no trend.
Historical / biographical background • Previous studies of trends in IEO in the Netherlands (NO CHANGE): • Dronkers et al. student cohorts (at age 12) 1965, 1973, 1981, 1989. • Peschar et al. on synthetic cohort one single survey (NPAO 1982). • De Graaf & Ganzeboom (1990, 1993) on synthetic cohorts in 10+ surveys: DECLINE.
The historical trend • Confirmed: • With linear regression • With loglinear models (uniform association, scaled association (RC-2). • With ordered logits • With sequential logits (transition model) [Shavit & Blossfeld 1993]: Dutch exceptionalism? Aggressive welfare state policies?
Our explanation • Use a lot of data, pooled surveys • Wider time window • More statistical power • Smoothing of survey peculiarities • Concentrate on global distribution of education (not transitions) • But note: • The estimated trend is far from trivial or small (-1% per year)
The problem • Trend towards less IEO is well documented. • Even the most recent accounts (Ganzeboom & Luijkx, 2004a, 2005b) find a linear trend. • However, there is reason to believe that the trend cannot continue. • When do we begin to observe a deceleration of the trend?
Main results • Model of IEO: • distinction between highest and lowest educated parent is more important than distinction between father and mother, or same-sex-parent. • Effects of parental education and father’s occupational status is the same for sons and daughters. • Non-linearity in trend • Effect of father’s status decreases non-linearly over time, slowing down significantly around 1970. • Significance was determined with parametric bootstrap of Lowess-curves.
Data • International Stratification and Mobility File (ISMF) – on the Netherlands • 25 surveys held between 1958 and 2003 with information on cohorts 1930-1989. • 80,000 respondents aged between 24 and 65, of which 40,000 have complete information on child's, father’s and mother’s education and father's occupation. • Number of cases are unequally distributed over cohorts.
Model 1: linear regression • Dependent variable is level of education and treated as continuous. • Parental education is either entered as father’s and/or mother’s education, highest and/or lowest educated parent, or education of same sex parent • Father’s occupational status is measured in ISEI scores • Trend in effects are measured as third order orthogonal polynomials or lowess curves.
Two objections against linear education • Regression coefficient is affected by both ‘real’ effects of parental characteristics on probabilities of making transitions and expansion of the educational distribution • True, if education is studied as a process • False, if education is studied as an outcome • education is discrete • this does not have to be a problem if there is no concentration in the lowest or highest category
Model 2:Stereotype Ordered Regression (SOR) • SOR allows for unordered dependent variable • SOR will estimate an optimal scaling of education and the effect of independent variables on this scaled education. • The dependent variable is nominal, and SOR reveals latent ordering based on associations with independent variables.
Model 3: Row Column Association Model II (RC2) • Objection against use of ISEI: • Effect of father’s occupation is better represented by small number of discrete classes, rather than on continuous scale. • Classes used are EGP classification. • RC2 is an extension of SOR as it also estimates an optimal scaling for FEGP
Father’s and mother’s education • Conventional model: Only father matters • Individual model: Both mother and father matter • Joint model: Effect of father and mother are equal • Dominance model: Highest educated parent matters • Modified Dominance model: Highest and lowest educated parent matter • Sex Role model: Same sex parent matters
Identifying periods with significant trend • A negative slope means a negative trend. • A positive slope means a positive trend. • A zero slope means no trend, or not enough information.
Identifying periods with significant change in trend • An accelerating trend means that a negative trend becomes more negative, so a negative change in slope. • A decelerating trend means that a negative trend becomes less negative, so a positive change in slope. • A constant trend means no change in slope.
Data • The ISMF dataset is converted into three new datasets, containing estimates of the association between father’s occupational status and child’s education for 60 annual cohorts. • One dataset for each technique. • The precision of the estimates (the standard error) is used to weigh the cohorts (weights are the inverse of error variances).
Lowess curves: locally weighted scatterplot smooth • We have a dataset consisting of estimates of IEO for each annual cohort which used only information from that cohort • If we think that IEO develops like a smooth curve over time, than nearby estimates also contain relevant information. • The lowess curve creates an improved estimate of the IEO for each cohort using information from nearby cohorts. • It results in a smooth line by connecting the lowess estimates. • Estimates of the trend and change in trend at each cohort can also be obtained from this curve.
Lowess curve in 1949 • Point on lowess curve in 1949 • Select closest 60% of the points. • Give larger weights to nearby points. • Adjust weights for precision of estimated IEO. • WLS regression of IEO on time, time squared and time cubed on weighted points. • Predicted value in 1949, is smoothed value of 1949. • First derivative in 1949 is trend in 1949. • Second derivative in 1949 is change in trend in 1949. • Repeat for all cohorts and all techniques and connect the dots.
Selecting spans • Percentage closest points (span) determines the smoothness of the lowess curve. • Trade-off between smoothness and goodness of fit. • Can be judged visually by comparing lowess curves with different spans. • Numerical representations of this trade-off are Generalized Cross Validation, and Akaike Information Criterion. • Lower values mean a better trade-off.
Bootstrap confidence intervals • Confidence interval gives the range of results that could plausibly occur just through sampling error. • Make many `datasets' that could have occurred just by sampling error. • Fit lowess curves through each `dataset'. • The area containing 90% of the curves is the 90% confidence interval. • The estimates of IEO are regression, SOR, RC2 coefficients with standard errors. • The standard error gives information about what values of IEO could plausibly occur in a `new' dataset.
OLS SOR RC2