100 likes | 216 Views
Is this thing on?. The SARs and Nesstar. Sam Smith Data Interface Developer Samples of Anonymised Records www.ccsr.ac.uk/sars/. About the data: Samples of Anonymised Records. Microdata from 1991 & 2001 censuses Currently 5 files of up to 2 million cases and up to 150+ variables (each)
E N D
The SARs and Nesstar Sam Smith Data Interface Developer Samples of Anonymised Recordswww.ccsr.ac.uk/sars/
About the data:Samples of Anonymised Records • Microdata from 1991 & 2001 censuses • Currently 5 files of up to 2 million cases and up to 150+ variables (each) • 5 million case file soon (less variables) www.ccsr.ac.uk/sars/
Nesstar Suite as a whole • It’s reliable. • It works. • Enough said.
Nesstar Publisher • Keeper of (most of) our metadata for files. • Checks consistency. • Import and Export facilities Single biggest reason why we use it so much.
Can we do…? Yes • Often get unpredictable bits of information to add, or changes that we need to make. • Ability to import/export the metadata to a text editor/perl script valuable. • As is validation when it comes back again.
Export to DDI • We put our DDI on our website for people using our data to use/reuse. • More people using our metadata makes it better. People find bizarre bugs. • XML is simple and easy to transform • People can find what they’re looking for
Other uses • Generation of HTML codebooks • Generation of PDF codebooks. • All from the same DDI master file: • Webview microdata • Online variable lists + metadata • Codebooks for printing • everything are all created from the same files.
Updating Metadata • The process: • Make a change in Publisher. • Save as DDI • Run a script.
What this gives us • Metadata consistency • Ease of update. • Ease of reuse by anyone • A (relatively) quiet life.