190 likes | 203 Views
Multilevel Modelling of PLASC data. Harvey Goldstein University of Bristol. Data structure. Students nested within schools Students changing schools over time Students moving between stages (e.g. Junior – Secondary) Students nested within neighbourhoods
E N D
Multilevel Modelling of PLASC data Harvey Goldstein University of Bristol
Data structure • Students nested within schools • Students changing schools over time • Students moving between stages (e.g. Junior – Secondary) • Students nested within neighbourhoods • Students moving between neighbourhoods • Statistical modelling requires use of cross-classifications and multiple membership models
Secondary Primary Student Multiple membership School Student A classification diagram Simple nesting School Student Crossing
PLASC classification structure Primary Secondary Neighbourhood Student • Selection of pupils to study: • By area (LEA, Region) • By type (e.g. ethnic group) • By type of school • In all cases we need to consider sampling the total population, movement of pupils in and out of areas, types of school etc.
Case study: all students in an LEA Issues: • How to deal with movement in and out (mobility) • How to measure time in each school • How to deal with ‘special’ schools • How to measure and use composition of school • How well does a ‘simple’ analysis approximate a full ‘structural’ model ?
Data and Mobility (turbulence) • Study of Secondary years 7 – 9. KS3 scores as outcome; KS2 as predictor; FSM, gender, ethnic group. • Mobility is movement between secondary schools (and possibly neighbourhoods) • We compare MM model with standard model using only KS3 school • We also study cross classification of KS2 school by secondary school(s).
Time in each school • Multiple membership requires us to weight school level contribution in proportion to time in each school. • PLASC only allows us to approximate these. • We know school each January and at KS2 and at KS3 • Units of measurement are 0.4, 0.6, 1.0 year.
‘Special’ schools • Some schools are very small and may take just SEN pupils for example. • We can deal with them as a separate category – we have in fact excluded them since they are few in number • Including them greatly increases the between school variation
Compositional effects • average KS2 scores of KS3 cohort • proportion FSM eligible for KS3 cohort • Proportion of minority ethnic groups • Average KS2 score for those remaining in KS3 school years 7 – 9; average for those not • Proportion of girls averaged over years 7 – 9 that student encounters in schools attended
Some results for MathsIntercept and KS2 only – Hampshire years 7-9 Simple Models with no MM using KS3 school + various predictors – no cohorts < 30 Note: correlation between residuals using Yr 7 rather than Yr 9 as ID is 0.90
Multiple membership model Also includes non-Hampshire schools years 7 & 8
Exclusions • We have 13 pupils who are in secondary schools outside Hampshire in yr 7 and year 8 where these schools do not appear elsewhere in data set. These have been excluded from the analysis as has 1 student who is not in Hampshire school in year 9. • We have also excluded pupils (200 or so) who are in 146 schools that do not appear to have Hamshire secondary IDs ( i.e. not beginning with 850). This leaves 170 schools.
MM vs simple hierarchy • Level 2 variance increases by about 11% using MM model. For these data the % students not in same school years 7-9 is about 5%. • Now look at more complex models • These cross-classify students by Yr 7 school – subscript v is primary, u is secondary
Intcpt + KS2 score + XC primary No Multiple membership. Adding mean KS2 score for class student was in at year 7 (s_ks2mean_po2), FSM eligibility at year 9 and year 7 (fsm_po4_1, fsm_po2_1) and gender (boy)
Inferences • Note primary school variance about twice that for secondary: primary schools are smaller and more homogeneous. Similar results in other studies • Now MM model increase in level 2 variance is about 5%. • Note significant effects of FSM status at both ages – 5% change status.
FSM YR9 x YR 7 • 0 1 TOTALS • 0 11843 307 12150 • 1 335 571 906 • TOTALS 12178 878 13056
Conclusions • Ignoring multiple membership where % mobility about 5% implies underestimation of level 2 variance up to ~ 11% • Adding primary school identification reduces level 2 secondary variance by ~ 15% but gives primary school variance up to twice secondary school variance. • The estimated school effects will also change.