180 likes | 273 Views
Best Practices in Data Dissemination Public Use Data files (PUF). Mohammed Sahmoud Programmer & Web Developer Dissemination & Documentation Department msahmoud@pcbs.gov.ps. September, 2013. PCBS’ dissemination Strategy. The Core. The PUF (digital files). The Proactive Dissemination.
E N D
Best Practices in Data Dissemination Public Use Data files (PUF) Mohammed Sahmoud Programmer & Web Developer Dissemination & Documentation Department msahmoud@pcbs.gov.ps September, 2013
PCBS’ dissemination Strategy The Core The PUF (digital files) The Proactive Dissemination The Social Responsibility User Producer Dialogue
PCBS’ dissemination Strategy • As part of PCBS dissemination strategy, the focus is on empowering users through: • Micro data for public use (PUF) • Micro data for special use • Statistical library Empower users
PCBS’ dissemination Strategy • Census micro data is available for public use for everyone with detailed information on the contents posted on PCBS’ web site • Micro data usage is regulated by special agreement and a license: Site, personal, and group license • All local universities have free copies of PCBS’ digital data files including the census
What is PUF: • Data in digital formats based on surveys and censuses as part of user empowerment • Data is treated to ensure the preservation of data confidentiality • Data is documented in user friendly manner along with the corresponding metadata to enable effective utilization of data by users • Use of data is regulated by agreement and license between the user and the PCBS
Why to PUF: • Break the rigid relation between surveys and type of output from 1-1 relation to 1-many: One surveys gives variety of outputs • Empowerusers mainly researchers, academic institutions, post graduate students • Data Feeder for on-line statistical databases as a By Product Output (PUF as input to statistical databases) • Reduce the burden off PCBS – Reduce data requests for detailed and customized data
The PUF System Input Process Output Survey Data • Public Use Data File (ASCII; SPSS) • Documentation (User Manual) Treatment of data according to predefined standards Census Data Methodologies Balancing between user’s needs and data confidentiality
The beginning of the PUF experience……. • Census 1997 as learning experience for the whole organization • Final results disseminated in nine months from census enumeration • Census was the first experiment in preparing microdata. Since then, all of PCBS surveys are disseminated in digital formats
Output of Census 1997 PLUS 20% sample as PUF
Census 1997 – The characteristics of the PUF • The micro data file was a sample with the following characteristics: • The sampling frame consists of a list of all households that have been counted during the population, housing and establishment census 1997. These households cover all households living, during the census, in the West Bank and Gaza Strip except those living in the annexed part of Jerusalem. • Regarding sample design, a systematic random sample of 20% of the households is selected, then all household members of the selected households are selected. • Regarding sampling weights, they are calculated and added to the data file, and they should be used when calculating estimates from the data file.
The PUF of Census 1997 – Selected International Publications • Taraki L (editor). Living Palestine. Family Survival, Resistance, and Mobility under Occupation. Syracuse University Press, New York 2006. • Hill, AG, CJ Clark, IM Lubbad and C Bruderlein (2006) Hope and despair over health in Gaza. British Medical Journal 333 (21):845-6. • Lubbad, I, CJ Clark and AG Hill (2006) Population Projections for Socio-economic Development in the Gaza Strip, Harvard Program on Humanitarian Policy and Conflict Research. • Hill, AG. Demographic and health prospects in the Occupied Palestinian Territory. A Congressional Briefing on Changing Demographics of the Middle East: How Policies and Programs Affect Stability in the Region sponsored by the Office of Representative Danny Davis (D-IL), the Office of Representative Rush Holt (D-NJ), The Office of Representative Mark Kirk (R-IL), Rayburn Building,
The PUF of Census 2007 – The approach • First step: surveying data users who used PUF 1997 about their views and feedback to incorporate their needs into the PUF of Census 2007 • Some of the comments received by users…………. • Our demand is for digital files which contain all of the data for particular locales, especially cities. This is because we have noted varied urbanities emerging in the OPT, ranging from Ramallah to Hebron and Gaza, which are important to study in detail in order to assess what is currently taking place now, and to plan for the future • Census micro data was like a dream come true for us as researchers • To have census digital data files posted on our intranet at the university for utilization by our graduate students is something that we are very proud of.
The PUF of Census 2007 – The approach • Second step: Technical committee to prepare the PUF • The PUF was prepared in 2009 • The Sample is 20% • Data available in ASCII and SPSS formats • Since 2009, it is available for PUBLIC USE • Provided to IPUMS projects so it is available for users worldwide
The PUF is compulsory output of every survey • More than 190 PUF data sets based on surveys and censuses • The closure of survey’s project not complete without the PUF • PUF release date is specified in PCBS statisticalcalendar
PCBS’ PUF – The Web Portal : http://www.pcbs.gov.ps/site/601/default.aspx
Digital data PUF (PUF++ (Researchers Standard PUF More variables More details (Less grouping of variables) Standard; All users
Best Practices in Data Dissemination Public Use Data files (PUF) September, 2013 Thank You