420 likes | 564 Views
Spatial Interaction: An Audit of Population Flow Data in the UK. Adam Dennett, Oliver Duke-Williams and John Stillwell School of Geography, University of Leeds Presentation of the 4 th International Conference on Population Geographies, Chinese University of Hong Kong, 10-13 July 2007.
E N D
Spatial Interaction: An Audit of Population Flow Data in the UK Adam Dennett, Oliver Duke-Williams and John Stillwell School of Geography, University of Leeds Presentation of the 4th International Conference on Population Geographies, Chinese University of Hong Kong, 10-13 July 2007
Outline of presentation • Introduction: relevant background on interaction data and CIDER. Why the need for an audit of interaction data? • Audit of Interaction Data Sources: a whistle-stop tour of interaction data in the UK. What datasets are available and what are the characteristics, advantages, shortcomings and accessibility issues associated? • What are the recommendations coming out of the audit? What will be the implications for CIDER?
Introduction – CIDER • CIDER: the Centre for Interaction Data Estimation and Research • Based now, principally, at the University of Leeds though software runs at Manchester • Data Support Unit: part of the ESRC-funded UK Census Programme
Access to Census Data and the Census Data Support Units Public access to key statistics, census area statistics and standard tables through National Statistics and NOMIS Provide essentially the same data for 2001, although CDU gives access to data from 1981 and 1991 as well. Census Registration Service (University of Essex) Based at at the University of Manchester. Provides access to census aggregate outputs from 1981 to 2001 through the interface. CeLSIUS, based at the London School of Hygiene and Tropical Medicine, provides access to the Longitudinal Study dataset, comprising linked records for 1% of the population of England and Wales from 1971. Based at the University of St-Andrews, the Scottish LS is a replica of the England and Wales LS, although samples 5.3% of the Scottish Population. Based at Edina at the University of Edinburgh provides access to digital boundary data associated with census outputs, as well as look-up tables for geographical conversion. Based principally in the School of Geography at the University of Leeds. Provides access to interaction datasets throughthe interface. SARs for small samples of households and individuals are supported by the Cathie Marsh Centre for Census and Survey Research (CCSR) based at the University of Manchester.
Introduction - CIDER • Currently we currently administer interaction (flow) data from the 1981, 1991 and 2001 Censuses Migration Commuting
Introduction – CIDER Data Sets and Geographies • 2001 Census: Special Workplace Statistics (SWS) (Levels 1, 2 & 3) • 2001 Census: Special Travel Statistics (STS) (Scotland Levels 1,2 & 3 and Level 2 Scottish postal sectors) • 2001 Census: Special Migration Statistics (SMS) (Levels 1,2 & 3) • Also comparable datasets from 1991 and 1981 • As well as the standard District, Ward and OA geographies available, different aggregations of these basic units, as well as various bespoke geographies are available for different data years
Introduction – CIDER’s Ongoing Objectives CIDER’s objectives of relevance to this presentation: • To expand the system to incorporate a range of UK interaction data sets from outside of the census • To gather/estimate further UK census-based data sets and include them in the system • To undertake research based on the current and future interaction data sets held within the WICID software system
Interaction Datasets in the UK: An Audit Purpose of the Audit: • Before adding new datasets to WICID, we need to know what’s out there! • To identify and evaluate sources of interaction data in the UK that might compliment the current census datasets held in WICID • To make recommendations relating to the inclusion of the most useful datasets in a new, expanded version of WICID
Interaction Datasets in the UK: An Audit • Interaction Data from the Census • Interaction Data from administrative data sources • Interaction Data from survey data sources
Census Tables • KS24: Migration (All people) • ST008: Resident type by age and sex and migration • ST009: Age of household reference person (HRP) and number of dependent children by migration of households • ST010: Household composition by migration of households • TT033: Migration (People): All people in the area and those who have moved from the area in the past year, within the UK
Census Tables Detail from Key Statistics Table 24 at district level
Origin Destination Statistics • Data available at 3 geographical levels: district, ward and output area • Level 3 (OA) data has been severely compromised due to the Small Cell Adjustment Method (SCAM) which has been implemented to preserve confidentiality
Commissioned Tables • Customised output from the 2001 Census may be commissioned from ONS Customer Services – costs involved • All commissioned tables, once paid for by original customer are then listed and made available for free by ONS on their website
Samples of Anonymised Records (SARs) • SARs have been available since the 1991 Census • In 1991 and 2001 a selection of different SAR files were produced, broadly based around either Individuals (2-3% sample) or Households (1% sample) • Opportunity to cross-tabulate variables not available in the main census tables
Samples of Anonymised Records (SARs) Issues: • Usefulness of licensed SAR files is still limited when studying interaction as even with relatively detailed destination geographies, origins are still coded very coarsely • No spatial coding of workplace location • The 2001 Controlled Access Microdata Samples (CAMS) offer more opportunity to study interaction with both origin and destination coded at LAD level, however CAMs small sample size restricts analysis of migrants to only a few variables
Longitudinal Studies (LS) • Three major census based longitudinal studies: • ONS Longitudinal Study of England and Wales • Scottish Longitudinal Study • Northern Ireland Longitudinal Study • There are also a range of non-census birth cohort studies: • The MRC National Survey of Health and Development (NSHD) • The National Child Development Study (NCDS) • 1970 British Cohort Study (BCS1970) • Millennium Cohort Study • New UK Household Longitudinal Study
Longitudinal Studies (LS) • Census based longitudinal studies tend to ask the same questions with each wave. Birth cohort studies often ask a core of repeated questions with a selection of questions which change with each wave • Very useful for studying migration patterns over the lifecourse Issues: • Potentially disclosive nature of longitudinal data means access is often strictly controlled • Spatial information is limited
NHSCR Data for England and Wales • No compulsory system in the UK to record movement of population • National Health Service Central Register (NHSCR) records patients who move and change their doctor • From this proxy data, estimates can be made of national migratory movements
NHSCR data for England and Wales Advantages NHSCR data has over the census • Measure of all moves that take place during a period – unlike census • They include students who register with a doctor or health service facility in the area of their further or higher education institution (unlike pre 2001 censuses) • Their quarterly availability means that they provide a continuous record of migration over time • Comprehensive as individuals are given an NHS record at birth and those in private health schemes usually maintain their NHS registration during their lives
NHSCR Data for England and Wales Disadvantages NHSCR data has when compared with the census • Only age and sex of attributes for individuals • The propensity to register with a new doctor will vary between different migrant subgroups, e.g. young males and pregnant women • They only provide data on flows between relatively coarse spatial units (FHSAs/HAs) • Some short-distance moves missed • Individual members of armed forces are only recorded as entries to and exits from the services - only the origin of recruits and the destination of those discharged are recorded • Long-stay psychiatric patients and prisoners are excluded altogether
NHSCR Data for England and Wales NHSCR data availability • Individual anonymised records or Primary Unit Data (PUD) have been available since the early 1980s • The TIMMIG system (TIMe series MIGration) developed at Leeds (Rees and Duke-Williams, 1993) merged quarterly data into annual blocks to create time series from 1975 to 1998 • NHSCR data availability post-1998 is more problematic • Between 1998 and 2001 - change from data being produced at the FHSA geography to the new HA geography. • Regional flows are freely available from the ONS, however flows between Health Authorities require special request to the ONS
NHSCR Data for Scotland and Northern Ireland • Little if any attempt to produce a consistent set of migration estimates across the whole of the UK • At best ONS produces flows to and from Scotland and Northern Ireland as a whole and HAs in England and Wales • Whilst more detailed in and out flow estimates have been produced, flow matrices for areas both within the countries and between these countries and the rest of the UK are not made available by ONS, NISRA or GRO Scotland
Pupil Level Annual School Census (PLASC) • Records, amongst other things, the home location of pupils and the schools they attend - potential for studying school commute • Results from the PLASC are held by the DfES - the National Pupil Database (established in 2002) contains much of this information • There is potential to track pupils and their movements over time, however there is no indication of a system that is being put in place to process this interaction data from the PLASC
Higher Education Statistics Agency (HESA) data • The Higher Education Statistics Agency (HESA) - collects and provides data on students and staff in HEIs as well destinations for HE graduates • There are ‘student’ and ‘first destination’ datasets • HESA holds postcodes for the permanent residence of students prior to entry to an HEI and the postcode of the administrative headquarters of the HEI attended • Potential for studying student migrations in great detail
Administrative Interaction Data Sources – Hospital Episode Statistics (HES) data • HES data contain information on each hospital episode experienced by a patient • Data are available for every financial year from 1989-90 onwards • Huge variety variables for each patient including location of residence and treatment • Potential for studying ‘commute to hospital’
Administrative Interaction Data Sources: Other Sources • Worker Registration Scheme • National Insurance Number Statistics (NINo) • Home Office asylum seekers and visitor switchers • Estimates of migration between the UK and the Irish Republic • Home Office • Eurostat • United Nations • Customer databases
Labour Force Survey (LFS - Great Britain and Northern Ireland) • GB LFS running since 1973 (in its current format since 1992). Surveys households on a range of socio-demographic variables • Primary unit data available to download from the Economic and Social Data Service (ESDS) enabling users to create their own inter-regional flow matrices
Labour Force Survey (Great Britain) Example of inter-regional flow matrix derived from the LFS
Labour Force Survey (Great Britain) • A special licence dataset is also available which is referenced to local authority level, although access is very restricted • It is also possible to derive an immigration matrix of foreign origins to regions • NI LFS broadly comparable to GB LFS although far less useful for deriving flows
International Passenger Survey (IPS) • Touted as ‘the richest source of information on international migration’, the IPS samples around 250,000 people entering and leaving the main British air and sea ports • IPS is the primary source of information for the Government’s Total International Migration (TIM) estimates Issues: • Seasonally variable sample survey • Misses many asylum seekers and their dependents • Only provides information on the intentions of respondents rather than their final actions • Anyone using primary unit data from the ESDS needs to be especially wary of seasonal adjustment issues
General Household Survey (GHS) and Northern Ireland Continuous Household Survey (CHS) • In existence since 1971 • Sample usually 8,000-10,000 households, (around 15,000-20,000 respondents) • Has always featured a question on the amount of time each respondent has lived at a current address Issues: • It is not possible to derive internal migration matrices from any questions in GHS • The Northern Ireland Continuous Household Survey (CHS) is related to the GHS, although 1983 is the only year where any measure of population migration can be derived
Integrated Household Survey (formerly Continuous Population Survey) • Fieldwork for the Integrated Household Survey (IHS) is scheduled to start in January 2008 • This survey will integrate several other ONS surveys including the Labour Force Survey (LFS), General Household Survey (GHS), Expenditure and Food Survey (EFS), the Annual Population Survey (formerly the NeSS Survey) and Omnibus Survey (OMN) • Reasonable to assume similar measures of migration to those derived from the LFS will be available from the HIS
National Travel Survey (NTS) • First commissioned in 1965/66, the National Travel Survey (NTS) has, since then, provided periodic snapshots of British travel behaviour • Data are only available from 1972 onwards, with continuous data only available from 1988 • Currently sampling around 16,000 addresses in Great Britain annually, Primary unit data are available to download for selected periods since 1972 from the ESDS • Inter-regional matrices can be derived from the primary unit data
Recommendations coming out of the Audit… Data should be included in WICID from the following four sources: • 2001 Census: the large and more complex matrices of migration and commuting flows commissioned from ONS that have national coverage at district and sub-district spatial scales • NHSCR: annual flows, from 1975 to 1998, of NHSCR patient re-registration movements between 100 FHSA-based zones, disaggregated by age and sex; and annual flows, from 1998/99 onwards, of NHS patients movements between HAs, disaggregated by age and sex
Recommendations coming out of the Audit… • HESA: annual flows, from 2001 onwards, of student movements between MLSOA of parental domicile and HEI, disaggregated by various characteristics • NHS IC: annual flows, from 2001 onwards, of hospital patients from LLSOA or MLSOA of residence to hospital, disaggregated by various attributes
Implications for CIDER • CIDER are currently in negotiation with the custodians of our targeted data sets to see if incorporation of the data into WICID is possible • All current indications are positive, however due to the differing availability and cost of the data, it is likely that the acquisition and incorporation of some data will happen before others • For some datasets, CIDER may need to secure additional funding before we are able to include it in the WICID system
Thank you Adam Dennett, Centre for Interaction Data Estimation and Research, School of Geography, University of Leeds a.r.dennett@leeds.ac.uk http://www.geog.leeds.ac.uk/people/a.dennett/ For the full audit: http://www.geog.leeds.ac.uk/wpapers/index.html